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PREFACE 


THE tensor caleulus used in the mathematical treat- 
ment of relativity, and concisely explained by Professor A. 8, 
Eddington in his ‘ Report on the Relativity Theory of 
Gravitation’, is, like the various kinds of vector calculus, 
a system of condensed notation which not only conduces to 
economy in the writing of symbols, but, what is more 
important, enables spatial and physical relationships to 
be grasped as a whole without having to be built up from 
a number of components which really represent views 
from different parts of space. Three-dimensional geometry 
or physics is troublesome enough: the addition of a fourth 
dimension made the need of a condensed notation imperative. 

Professor Eddington has recently pointed out that the 
tensor notation and methods can be applied, with happy 
results, to other and more elementary classes of problems 
than those for which they were originally devised; and 
this book is an attempt to put his somewhat compressed 
exposition into a form in which it may appeal to a larger 
circle of readers. The book, therefore, is not intended as 
an introduction to the mathematical theory of relativity— 
though I hope it may be of some use for that purpose— 
but rather as an exercise in the elementary application of 
methods which, apart from any practical use, possess a special 
beauty of their own. 

The new notation is not introduced until the fifth chapter. 
The properties of determinants, which serve as the starting- 
point for the application of the notation, are familiar to the 
mathematician; but, as I hope the book may be read by 
some who are not entirely at ease with determinants, 
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I have commenced with four chapters on the elementary 
theory of the subject. I make no apology for doing this, 
instead of referring the reader to the ordinary text-book 
on algebra. The text-book treatment is not always stimu- 
lating ; the reasons for the various stages are not necessarily 
clear to the student ; and attempt at simplicity sometimes 
leads to loss of rigidity in proof. In such a subject it 
is necessary to take the reader into one’s confidence; and 
this earlier part may in this respect be found helpful to 
some, teachers or students, to whom the later part makes 
at first a less strong appeal. 

I have added a chapter on some applications to the 
theory of statistics, to which the tensor caleulus seems 
specially suitable. The basis of this portion, so far as 
method is concerned, is a short paper by Professor Eddington, 
mentioned at the end of the chapter. This is one only of 
many possible applications. 

What I have called double sets will be recognized by 
the advanced student as matrices; and many of the pro- 
positions will be found to be familiar. But the tensor 
calculus may fairly claim that, in bringing into close 
relation various branches of mathematical study, previously 
regarded as distinct, it gives them a new life. 

I have to thank Professor Eddington for looking at my 
manuscript and making some corrections and suggestions. 


5 June 1928. W.F.S. 
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INTRODUCTION 


As this is a comparatively new subject to most readers, 
it may be as well to explain briefly what it is about. 


A vector in (say) 3 dimensions is a directed quantity, 
determined as regards both direction and magnitude by its 
components, which are magnitudes measured along three 
definite axes. These axes being supposed to have been fixed 
beforehand, we can take them in some definite order; and 
a vector @ is then determined by a set of 3 quantities, 
which we may call 

Dae, Fm 
Algebraically, the idea of a vector can be extended to some- 
thing which is determined by a set of m quantities 
Ani A ned ce has 
where m may have any value. 
A determinant (say) 


dy Oy +e; 
Aenea, 
Ay Us 6, 


is the algebraical sum of all the products that can be 
formed in a certain way according to a certain rule of signs 
from the set of quantities 


a bh 
Ay by Cy 
G05 C: 


Each ‘element’ of this set has its position fixed in the set 
by the numbers of horizontal and of vertical steps required 
to reach it from the initial element a. Thus the set is 
extended in two directions, while the set which determines 
a vector is extended in one direction only. This applies to 
a determinant with any number m? of elements. 
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In the same way we might have a set extended in 3 direc- 
tions, the symbols being written along the edges of a cube 
and along lines inside the cube or on its faces; and we 
could, in theory, increase the number of ‘directions’ to 
4 or more, by proper convention as to the order in which 
the elements are to be taken. On the other hand, a single 
quantity—what in the language of vectors is called a scalar 
—may be regarded as a set not extended in any direction. 

The tensor calculus, using the word ‘tensor’ in its broad 
sense, deals with all these different kinds of sets, in relation 
to sets of variables by which we can regard axes of refer- 
ence as being determined. In the narrower sense in which 
the word is used in reference to the theory of relativity, 
only sets which satisfy certain conditions are called tensors. 

In this book I have treated the tensor calculus as arising 
out of the use of determinants. Chapters I-IV deal with 
the elementary theory of determinants, so far as it is 
required for our purpose. (The student who is familiar 
with determinants can skip these chapters.) In Chapter V 
the tensor notation is introduced in successive steps, with 
explanatory remarks. These latter are in small print, not 
because they are less important, but in order not to break 
the continuity of the chapter as a whole. In Chapter VI 
these explanatory paragraphs (or parts of them) are brought 
together and amplified so as to give a general idea of the 
elementary properties of sets. Chapters VII and VIII 
deal with some developments of the subject in its 
general aspect. Chapter IX shows the application of the 
methods to certain problems in the theory of statistics and 
of error; this can be omitted hy any one who wishes to 
pass on to Chapter X, which deals very briefly with the 
tensor in its more limited sense, as applied to the theory 
of relativity. 


DETERMINANTS 


I. ORIGIN OF DETERMINANTS 


I, 1. Solution of simultaneous equations.—Deter- 
minants ordinarily arise out of the solution of simultaneous 
Suppose we have two equations 

5a+2y= 19 

4x+3y = 18 : 

Then, if we used only elementary methods, we could 
multiply the first by 3 and the second by 2, which would 


equations. 


give 
15%+6y = 57). 
8u+6y = 36 ; 
and thence, by subtracting, we should have 
lise 


whence either equation would give 
¥ = 2. 
Similarly, if we had three equations 
2@+5y+3z= 4 
e—3y—2z2= —l1}, 
—5¢4@-—4y+ z= 7 
we could, by eliminating z between the first and the 
second and between the second and the third, obtain 
ta@+y= § : 
—9x2—1ly = 13)’ 
whence, proceeding as before, we should obtain 
@=1, y= —2, 2= 4. 
This process of successive elimination is tedious, especially 
when there are more than two unknowns; and it is found 
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bettec to obtain a formula for the general solution and 
apply it to the numerical values of the particular case. 

Since such an equation as w—3y—2z = —1 can be 
written in the form (+1)#+(—3)y+(—2)z = (—1), we 
can use positive signs throughout, it being understood that 
the quantity represented by any symbol may be either 
positive or negative. 


I. 2. Formula for solution.—(i) For completeness we 
begin with one unknown. The equation 
OT ae ky 
gives 


(ii) For two unknowns the equations may be written 
ap hyy =%,) 
yx + boy = ky 
Multiplying the first equation by 6, and the second by 4,, 
. and subtracting, we get 


(410, —4b,)@ = hy b,—h,b,, 
whence 
e kyb,—kyb, 


av . 
ay by _ as by 


Similarly, interchanging a’s and 0’s, 


ee hy dy— hay 
b,a,—b, a, 
= ak, —a,k, 


a,b, _ Ayby 


It should be noticed that the expressions for # and for y 
have the same denominator, and that the numerators are 
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obtained from the denominator by replacing the a’s in the 
one case, and the @’s in the other, by #’s. 
(iii) Next take the case of three unknowns. Let the 
equations be 
aethy+aqc= a] 
A,U+b,y+ Cot = ky}. 
a,t +64 +¢,2 = ke 
Eliminating z from the first two equations, we obtain 
(446g qq) @ + (1p —bn03) y = hy Cg — ht. 
Similarly from the second and third equations 
(4203 — 450) @ + (420; —b5¢2)y = hac, — hye. 
Then, eliminating y from these equations, we get 
ee (44.0 — fa Cy) (4203 — 4500) — (hols — 300) (21 Cg — 4901) ; 
(4 C= 4g0y) (0903 — Fs €0) — (€q¢3 — 420) (44 Cg — 90%) 
As before, the numerator is got from the denominator 
by replacing a’s by 4’s, and we therefore need only consider 
the denominator. Multiplying out, it becomes 


AD Cy Cy — Ay 0503 — Aghy Cy Cs + Ag ba0y Cy — W194) Cy lg + Ag hal Cy 
+ 0,0, 02 —A,b501 C5 
C0 (Uy Ie C3 M Iig ly = Fg 0y C3 $ Ay h0y + 344g — 415490). 
Hence, replacing the a’s by 4’s for the numerator, 
oe bts ite — hint 5 + hig bgt + fishy Cy—highey 
Ay OC, — Ay Do.Cy — Ag D1 Cy + Ao b0y + Aa b1 Cy — Ag b4¢y 


Corresponding expressions can be obtained for y and for z. 


I. 3. General problem.—(i) We might proceed in the 
same way for equations in four or more unknowns. But 
this would mean that each case would have to be considered 
separately ; and not only should we fail to get a general 
formula, but the algebraical work would soon become 
practically impossible. We therefore alter our tactics. 
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We write down the general equations involving m un- 
knowns @, ¥, 2,...W 
a,a+by + etn tfhv=h . 
Ag+ UY + Coetur tfov = Up 5 (EE S28) 


in ® + Om Y + me +000 thm! = km 
guess at a solution, and then verify that this solution does 
actually satisfy the equations. 

(ii) The values of xz, y, z...w as found from these equa- 
tions will be in the form of fractions. We will consider 
first the denominators. Putting together the results ob- 
tained in § 2, for the cases of m = 1, 2, 3, we find that the 
successive denominators, which we will eall D@), D®, D), 
are 

DOVE, 

D®) = a,b,—a,), 


DO) = a b.,03 = 0b 5Cy— Ay), C3 + Og h3cy +4504 Co | 


(1324) 


— boc, 
We want to obtain an expression D\™), which we should 
guess to be the common denominator in the solution of 
the equations (I. 3. A), and of which D®, D®, D® shall 
be the particular cases for m= 1, 2, 3. 

(iii) The three D’s in (I. 3. 1) have a general similarity, 
which enables us to obtain a formula for D™. It will be 
seen that both in D®) and in D®) some of the terms have 
sign + and some have sign —. We will see first how 
the terms are constructed, and then consider the question 
of sign. 


I. 4. Construction of terms.—(i) For each of the three 
D's, for which the values of wm are 1, 2, 3 respectively, each 
term is the product of m factors, which are the coefficients 
in the equations in§ 2. In writing down these coefficients, 
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it is convenient to keep them in the relative positions in 
which they occur in the equations. Thus we get 


For D®) For D®) For D®) 
ay a, 0, a, 6, ¢, 
Ges CE at 
a, b; Cg 


In each case we have a set of quantities arranged in the 
form of a square. The individual quantities are called 
the elements of the set; the quantities in a vertical line 
constitute a column, and the quantities in a horizontal 
line constitute a row. The columns are numbered from 
the left, and the rows from the top. The diagonal drawn 
from the top left-hand corner—i.e. the diagonal through 
a,—is called the leading diagonal. 

(ii) Each term contains m factors, which are taken from 
the set in such a way that one (only) shall come from each 
column and that one (only) shall come from each row. Also 
DX) contains every term which can be constructed in this 
way. Take, for example, D@), Since one factor is to come 
from each column, the factors are ang,a6,andac. The 
a can be either a, or a, or a, 1.e. it can be taken in three 
ways; when one of these three a’s has been taken, one row 
has been used up, and the 4 can only be taken in two ways; 
and, when one of the two @’s has been taken, the e can 
similarly only be taken in one way. There are therefore 
3.2.1=6 possible combinations of factors; and this is 
the number of terms in D®), 

(iii) Another way of stating the thing is that, if we 
keep to a fixed order a 4c of the factors in each term, 
the suffixes of the factors are the numbers 1 or 1 2 or 1 2 3, 
arranged in different ways, and there is one term for each 
of the possible arrangements. 
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(iv) We conclude that D™ contains terms, each of which 
is constructed by taking m factors from the set of m x m= m? 


quantities 
ay b, Cj. ere 
ae be ea voto 


am ine Cm Ses Tin 
in such a way that there shall be one factor (only) taken 
from each column and one (only) from each row; there 
being one term for each of the m (m—1)...1=m! ways 
in which this can be done. Or, which comes to the same 
thing, that the terms are made up of factors adc... f with 
suffixes 12 3... arranged in different ways, there being 
one term for each of the wm! possible different arrangements. 


I. 5. Rule of signs.—(i) It will be seen that, in the D’s 
after D“), half of the terms are positive and half negative, 
and that in each case the term found from the elements in 
the leading diagonal—namely «@, or a1, or @,/,¢;—is posi- 
tive. We should therefore expect that half of the terms 
in D(™) would be positive and half negative, the term 
dy by C,.-+f;,j—which we call the leading term and usually 
place first—being positive. The difficult question is that 
of signs. In the case of (say) 1 = 5, how are we to know 
whether such a term as a,/,¢,7,¢, is to have the sign + 
or —? 


(ii) The sign of a term must, if the letters a Jc... fare 
kept in their original order, depend on the arrangement of 
the suffixes, i.e. on the extent to which this has departed 
from the initial arrangement 1 23...%. Now any arrange- 
ment such as 35241 can be got from the initial arrangement 
12345 by a series of interchanges of adjacent figures. We 
must fix a definite order in which these interchanges are to 
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be made. We therefore say that each figure is to be moved in 
turn, beginning with that which ultimately comes first, then 
that which ultimately comes second, and so on. Thus in this 
particular case the successive stages would (repeating for each 
group of interchanges the arrangement from which we start) 
be 12345, 13245, 31245; 1245, 1254, 1524, 5124; 124, 214; 
14, 41; 1: a total of seven interchanges. Now let us look 
at the signs in JD!) and D®), In D@) the arrangement 21 
is obtained from 12 by 1 interchange, and the sign for 21 
is —. In D®) the signs of the successive terms, and the 
stages by which the final arrangements of suffixes are 
obtained, are as follows, the numbers of interchanges being 
added in heavy type: 


25)? 0 ane tas ee loc aol: eae 2 
emg 2 132-0. 4 me Ya Bs is oe aan ae 2 
mm 2h 2 Le. su. =. | 123; 132,,912.321.... 3 


It will be seen that both for J@) and for D©) the sign is 
— or + according as the number of interchanges is odd 
or even. We therefore adopt this as our rule; in the case 
of a,6,c,d,¢,, for instance, seven interchanges are necessary, 
and the sign is therefore —. 


(iii) In order to find the sign of any given term by 
the above rule, it would be necessary to perform all 
the interchanges. A shorter method is to look at the 
term as it stands and to consider the reversals of order in 
it; i.e. taking the suffixes of the term in pairs in every 
possible way without altering their order, to see in how 
many cases the numbers are in the reverse of their order in 
the leading term, i.e. are in descending instead of ascending 
order, The term a,4,c,d,¢,, for instance, gives the following 
pairs, those in which the order is reversed being printed 
in heavier type:—35, 32, 34, 81, 52, 54, 51, 24, 21, 41. 


2611 Cc 
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It is easily seen that each interchange, of the kind described 
in (ii) above, produces one reversal of order; for, while we 
are shifting one number, such as the 5 in the second group 
of arrangements there shown, the relative order of the 
other numbers remains unaltered. It follows that the 
number of reversals of order is the same as the number 
of interchanges of this kind; and therefore the sign of 
a term will be — or + according as the number of reversals 
of order is odd or even. 


(iv) Zhe interchange of any two suffixes in a term changes 
the sign of the term. 


[Let the two suffixes be ¢ and W; ¢ coming before y in the 
term in question, but not necessarily being before it in numerical 
order. 

(1) First let @ and w be adjacent. Then the interchange of 
op and yf increases or decreases the number of reversals of order 
by 1, and therefore changes the sign of the term. 

(2) Next suppose that there are x suffixes between ¢ and w. 
Then we can move vy in front of ¢ by « + 1 interchanges with 
the adjacent term, and then move ¢ into the original position 
of y by x interchanges. This is a total of 2 + 1 interchanges, 
each of which in succession makes a change of sign: the total 
result is to change the sign of the term.] 


(v) We have so far assumed that the factors of a term 
are arranged in the original order of the letters a ded.... 
Now suppose that the order of the factors is altered in any 
way. How does this affect the rule of signs ? 

The alteration of order can be brought about by a series 
of interchanges of factors. Suppose there is an interchange 
of a, and 4,. Then, by (iv), the number of reversals of 
order of suffixes is altered by an odd number, but the 
number of reversals of order of letters is also altered by 
an odd number; and therefore, if we consider the sum 
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of the numbers of reversals of order of letters and of 
suffixes, this sum either is not altered or is altered by an 
even number. It follows that, if the factors of a term have 
been shifted about so that the letters abe... are not in their 
original order, the sign of the term depends on the sum of the 
numbers of reversals of order of letters and of suffixes re- 
spectively, being — or + according as this sum is odd or even. 
For example, in d,4,;¢,a,¢, there are five reversals of order 
of the letters and eight of the suffixes, so that the sign is —. 
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IJ. 1. Definition of determinant.—We can now com- 
bine the results obtained in I. 4 and I.5. We suppose that 
we are dealing with a set of m xm =m? quantities, which 
we can arrange in the form of a square, thus (the quantities 
being denoted by crosses) : 

Pe 2S 
Ke OM Kec oS 
Then the expression which we have to consider is the 
algebraical sum of a number of terms, of which some are 
taken positively and some negatively. Each term (apart 
from sign) is the product of m elements of the set, taken 
in such a way that one element (only) shall come from each 
column and that one element (only) shall come from 
each row; and there are m! terms, corresponding to the 
m different ways in which this can be done. The leading 
term is the term containing the elements in the leading 
diagonal of the square, and has sign +. The signs of the 
other terms are to be found by replacing the elements of 


the set by a, a,... 4,, ete., arranged as a hey set: 
mH Wy ef 
a,b, Cy wey 


an Pin Cm BS on 
The sign of a term is then — or + according as the sum of 
the numbers of reversals of order of the letters and of the 
suffixes, as compared with the leading term a, Jy¢y . «+ fms 
is odd or even. 
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The algebraical sum of the terms so obtained, namely 
Ob 903 +0» fin —Unby C3.» fn + ete., is called the determinant 
of the set, and will be denoted by D. It should however 
be observed that it is the determinant of the set as so 
arranged ; with different arrangements of the elements of 
a set, still keeping them in a square, we may obtain 
different determinants. 

We can therefore define the determinant as the alye- 
braical sum of terms of the form On baly ++ where par... 
are the numbers 123... m arranged in some order, there 
being a term for each of the m! possible orders, and the sign 
prefixed to the term being + for the natural order 123...m 
and — or + for other orders according as the number of 
reversals of natural order is odd or even. 

The symbol for the determinant is constructed by placing 
single vertical lines before and after the set; thus 


a, b Gy 
Wi 
a, Ob, Cg 


means the determinant @,/,c,— etc., which we have called 
De), 

The terminology is the same as is given in I. 4 and I. 5 
for a set. The quantities between the vertical lines are 
the elements of the determinant. Those in a vertical line 
are a column; those in a horizontal line are a row. The 
leading diagonal is the diagonal drawn from the top left- 
hand corner; and the leading term is that containing the 
elements through which the leading diagonal passes. The 
leading term, as already stated, is taken positively. 

If the symbol for a determinant contains m columns and 
m rows, the determinant is said to be of the mth order. 


II. 2. Elementary properties.—(i) From the mode of 
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construction it follows that each element of the determinant 
appears in (m—1)! out of the m! terms; and there are no 
two terms having more than m— 2 factors alike. 

(ii) If each element of a column or of a row is 0, the 
determinant is = 0. [For each term contains one of these 
elements as a factor. | 


II. 3. Properties depending on the rule of signs.— 
(i) The value of a determinant is not altered by making the columns 
rows and the rows columns ; e.g., for m= 4, 


a, 6, ¢ ad | = | a, @, a & |. 
Gn Upon, | by eG, 8, 
CPM Seat E Cy) ha Oe ee 
Gp ul ae a ihe day Gy 


[Let D be the determinant, and R the new determinant obtained 
by making columns rows and rows columns in the symbol for D. 
Then, apart from sign, D and R obviously have the same m! terms. 
We have therefore only to consider signs. ‘he two determinants 
have the same leading term, which is positive in both. Lett be 
any other term of D, say a,b,c,d,... . Then ¢ also occurs in R, 
but, since the terms of R must be constructed according to the 
system prescribed in our definition of a determinant, the factors 
of tin R will be arranged in the numerical order of the suffixes, 
namely d,c,a,b,... . The sign of ¢ in D depends on the number 
of reversals of order in the suffixes3421..., and the sign of 
¢ in R depends on the number of reversals of order in the letters 
dcab... . But each of these numbers is the sum of the numbers 
of reversals of order of letters and of suffixes as compared with the 
original orders abcd... and1234...; and, by I. 5 (v), these 
sums are either both odd or both even. It follows that the sign of 

is the same in both determinants. This is true for each term 
of D or R; and the two determinants are therefore equal.] 


If two determinants correspond so that the columns 


of one are the rows of the other, each determinant is said 
to be the transposed of the other 
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(ii) It follows from (i) that any statement as to columns 
or rows is equally true for rows or columns. We shall 
indicate this, for conciseness, by ‘column [row]’ or ‘row 
[column ]’. . 


(iii) Lf any two columns [rows] of a determinant are inter- 
changed, the absolute magnitude of the determinant remains 
unaltered, but its sign is changed. 


[Suppose, e. g., that we interchange the b’s and the e’s. Let ¢ 
and w be any two suffixes. Then, in the original determinant, 
corresponding to any term which contains by and ey, there is 
another term exactly similar except that the factors are by and eg 
and these two terms, by I. 5 (iv), are of opposite sign. The effect 
of interchanging the b’s and the e's is that the two terms are inter- 
changed, i.e. the sign of each is changed. This applies to every 
such pair of terms. } 

(iv) If any two columns [rows| of a determinant are iden- 
tical, the determinant is = 0. 

[We can see this in either of two ways. 

(1) Consider a pair of terms such as are mentioned in (iii). 
The one contains by and ey ; the other is exactly similar, except 
that it contains by and ey; and the two terms have opposite 
signs. If by = ey and by = ey, the two terms cancel. The whole 
determinant is made up of such pairs. 

(2) More briefly, suppose we interchange the two columns 
which are identical. Then the determinant remains unaltered. 
But, by (iii), its sign is changed. This can only be the case if 
the determinant is 0.] 

(v) Since columns and rows may be interchanged, a 
determinant is sometimes represented by its leading diago- 
nal alone, if this indicates a system for insertion of the 
remaining elements. The notation is 


| 1 ee5e-Sm |= ly 4 Gwe A [3 
2 8% Cy we Sy 
3% vs 


m ™m my voodin 
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it is then immaterial, so far as the value of the determinant 
is concerned, whether we enter the a’s as a column or 
as a row. But it should be mentioned that the relative 
arrangement of columns and rows is of importance later on, 
when we come to consider properties of sets of quantities. 


II. 4. Cofactors and minors.—(i) In the complete ex- 
pression for D, each term contains one a, which is either a, 
or a, or a, ete. We can therefore group the terms accord- 
ing to the a’s they contain. In the case of D©), for 
instance, 


Cee SON og OY 90g — Ay ly — Uy) Cg + Ay F0,0 + A391 Ce 
a, b, Cy — 43050, 
tix th © ‘ 


= 4; (by¢,— 802) + A (— 04 C5 +450) + 45 (44 ¢e 
—b,¢,). 
Suppose that the terms of D are grouped in this way; and 
let the resulting coefficients of a, a,a,... a, be denoted 
by 474 Ant ae pehen 
D = a, A, +4,A,+4,A, +... +mAm- 
Similarly, if we group the terms according to the 4’s or 
c’s ete. they contain and denote the coefficients of the 0’s 
ori c's ete. ‘by 2, By By. s4Bay ony G, Carn eee 
shall have 
D=b,B,+ 1, Bs +6,By+...+4m Bus 
D= eC, + Co0,+ 3 C3+ vet Co Ova 
ete. 


m 


The 4’s, B's, ete., are called the cofactors of the correspond- 
ing elements of the determinant ; thus the cofactor of 2, is 
B., where 6, B, is the sum of all the terms which contain 45. 


(11) The terms which contain the leading element a, are 
obtained from the leading term a,4,¢; ... fj, by altering 
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the suffixes, and prefixing the proper sign to the term, in 
the manner already described, with the proviso that the 
factor a, remains unaltered. But this process will give us 
the products, by @,, of the terms so constructed from 
a leading term 4,¢,...f/,,. In other words, the cofactor 
of a, is the determinant 


I eae 
b, Cs ts 
Orr. Cm eg tm 
In D®), for example, it is 
b20;—I3ly = | by Cy 
by 1; 


(ii1) The determinant which is obtained from D by 
striking out the column and the row which contain any 
element of the determinant is called the minor of that 
element in the determinant. 

We see from (ii) that 

A, =cofactor of a, = minor of a,. 

We might show in the same way that the cofactor of 
any other element, say c,,1s equal to the minor of that 
element, with the sign — or + prefixed according to the 
position of the element in the determinant: but it is 
simpler to find the cofactor by bringing the element into 
the position of a,. Let the element be in the gth column 
and the 7th row. Then we can make it the leading 
element, without altering the order of the other columns 
or rows, by means of g—1 interchanges of its column with 
an adjoining column and 7—1 interchanges of its row with 
an adjoining row. Hach of these interchanges, by II. 3 (11), 
multiplies the determinant by —1; and the total result is 
to multiply by -1 or by +1 according as g+7—2 is odd 
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or even. Having got the element into the position of the 
leading element, we strike out the first column and the first 
row ; the result, apart from the prefixed sign, is still to give 
the minor of the element, since the relative positions of the 
other columns and rows are unaltered. Hence the cofactor 
of any element is equal to its minor with the sign — or + pre- 
fixed according as the number of steps from the leading element 
to this element is odd or even; it being understood that each 
step is either horizontally from one column to the next or 
vertically from one row to the next. or example, 

A, = +minor of a,, 

C, = —minor of ¢,, 


(iv) We have found in (i) that 
a, A, ap a,A,+ a,A,+ a An Am = D | 
6, By +0, By Pb, Bet... 6, Be = BPs (ita) 
ete. 
We have now to find the value of such sums as 
a, By +a, Byt+ as Byt 6 +dmBrm, 
0,4, +b,4, +04, +... +b mA, 
Cy Ay +p dg + 0gd5 +e tC Ans 
ete. 

Let us take the second and third of these as examples, 
but replace the 2’s or the c’s by 6’s. Then we want to 
find the value of 

0, A, + 0,49 + O3Ag + .++ + Om Am: 
Now we see from (IJ. 4. 1) that this is the value of the 
determinant 
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for the cofactors of 6, 6,63... 6,, in this determinant are 
the same as the cofactors of a, a, a3...» in D, ie. are 
A, A, A,...A,. Let us replace 6 throughout this deter- 
minant by any letter, other than a, occurring in JD, e.g. 
by c. Then the determinant becomes 


ee Ore. ces, 
2 9, & on fy 
Cs ds Cs Ss 


Cm Oa Cm Sere: Va 
But this is a determinant which has two columns identi- 
eal, and its value, by § 3 (iv), is 0. Hence 
pay Gs AO, Ant cot Cy, =O | 
Similarly 6,4, +554) +1,d4y + +0. +0m4m = 01 
a, B, +4, By +4, By+... +A Bm =| 
ete. 


. (II. 4, 2) 


(v) Now let us interchange the columns with the rows, 
so that the determinant becomes 


Op On ov 


1 3 ™m 
0, Oy bg... Om 
Ce a ee 
Ai So Sa-- Sn 


Then, by § 3 (1), the value of the new determinant is the 
same as that of the old, i.e. is D. Also the minor of any 
element € in the new determinant is the same as its minor 
in the old determinant, but with columns and rows inter- 
changed, so that its value is unaltered; and the number of 
steps from a, to € is the same in both determinants. The 
cofactor of € in the new determinant is therefore equal to 
its cofactor in the old determinant. Hence by applying 
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(II. 4. 1) and (II. 4. 2) to the new determinant we get 


new sets of relations, namely 


a,4,+4, By +e,G,4+. 
4,4,+b,B,+¢,C,+.. 


ete. 
and 


a,4,+b,B,+¢,0,+.. 
ty A, +0, B,+0¢,C,+.. 


ete. 


ati = 
wot Ye Ne eet Wy) (II. 4. 3) 
tf, = 0 
a ee (II. 4. 4) 


(vi) If all the elements in a column [row], except one, 
are 0, the determinant is equal to the product of that one 


by its cofactor. 


III. SOLUTION OF SIMULTANEOUS 
EQUATIONS 


III. 1. Statement of previous results.—We have 
next to consider the solution of the simultaneous equations 
(I. 3. A). Before we do this, it will be convenient to 
express in determinant form the results obtained in I. 2. 
These results are as follows : 


jit ae=%,, then*® #=—|2,|=|2,). 


(2) If a,a+by =k, 
Dy ae 
then 
w=(|h bh |+|aq |), yH|q lel uy 4 
i by & bg a, a ae bg 
(3) If aathytoae = hy) 
eet hart bh 
a,0+6,Y + Cz = he 
then 
ee eo ee ie I, |) ae ey ey [ee DO), 
hy by Cy a, hy Cp 
hee 0s 85 iG yar 
=| a, b, ky + D@, 
| by | 
Dae fh, 
where 
DOV tae Oy atl; 
c Does 
a, 6, ¢, 


* Here, as elsewhere, vertical lines denote a determinant, not 
‘absolute value’. 
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III. 2. General solution.—The general equations of 
which we require a solution are those set out in (I. 3. A), 
namely : 

aat+ byt oqzt+..+ frw=h, 
a,t+ boy + Cet... t fom =he |. (IIT. 2. A) 


Ay @ + bm Y + me + ici Tye = horn 

The form of the solution is suggested by the results 
given above. Multiplying the successive equations by 
A,, Ay,... dm, and adding, we have 

(4,4, +4,4, +... +4mAm)® 
+ (6,4, +0,4,+ ... +0, Am)Y 
+ (¢, A, +g 4g 4... +6mAm) 2 
ar ooo 
+ (fA, t+ fede t e+ Sin Am) Y= hy Ay + hg 4gt oe thm Am: 
By (II. 4. 1) and (II. 4. 2) the coefficient of x is equal 
to D, and those of y, z,...w are equal to 0. Also the 
expression on the right-hand side is what D would become 
if we replaced the a’s by 4’s. Hence 
= (4,4, + hy Ag+ tee + km Am) ane: 


fa . ees leg, > r 
NE Oy. Cy heady ee eae ee eee 
he Op. Og ciate He ie eae 
kin Om Sm +++ Sm | Gin Om Cm »++Sm 
Similarly 
2 % ATT So 
Y=|% Ay eA [+ | yh yf, |; | 
el Ne Coney ie Woe Beet G 
| Am kin Cin Im an bm Cm vet 


and so on. 

If, for verification, we substitute these values in the 
original equations, it will be found that the relations 
(II. 4. 3) and (II. 4. 4) come into play. 


IV. PROPERTIES OF DETERMINANTS 
(continued) 


IV. 1. Sum of determinants.—If two determinants 
are identical except as regards one column [row], their sum 
is a similar determinant in which the elements of that 
column [row] are the sums of corresponding elements in 
the two determinants. [For example 


ee eee Py 0 (Ge | | yay Oe ec, 
ee, don On- ney Gotd, b, |, 
Ox C. oe Uae G44, 0, Cs 


since it is = (4,4, +4,4,+4,d,) + (4,4, +4, 4,4 4,4.) 
= (4, + 4,) Ay + (42+ dy) dy + (45+ ds) As. | 


IV. 2. Multiplication of determinant by a single 
factor.—If each element of a column [row] is multiplied 
by the same factor, the determinant is multiplied by that 
factor. [For example | 


da, 6, & | =Aa,A,+Aa,4,+A0,4,=A| a, 4, & |.] 
Rin i. Co Cp ie es 
da, bs C3 | as bs C3 


IV. 3. Alteration of column or row.—If the elements 
of a column [row] are multiplied by a single factor and added 
to the corresponding elements of another column [row], the 
value of the determinant is not altered. [For example 

Gite, 0, ¢ =| a o & | +Ale, o |) =| ay & lel 
Agtrly Oy Cy @, Ups, Cor Oxi Ogee! Co | 


az +Xeg by Cs az b, Cy Cz bg Cs M3 0 Cz 
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IV. 4. Calculation of determinant.—(1) There are two 
main methods for calculation of a numerical determinant. 
(a) When m is small, we can use the formula 


D=G-A, +a, Ao... Oy, Ams 
repeating the process as often as may be necessary. Thus, if 


D=\|3 —-2 7 
5 ln 27), 
4 6 1 
oe ae iar DR ase 7 
FG) ety Pree | 1 —8 
= 3.19—5(—44)+4(—1) = 273. 


(0) When m is large, we can reduce the determinant 
to one of order m—1 by means of § 3 and II. 4 (vi). 
Applying this method to the above example, we could 
multiply the first row by § and subtract from the second, 
and also multiply it by $ and subtract from the third. 
To avoid fractions, we multiply D twice by 3. Then 


IY i (es few” OP 3 —2 7 
1b 93) 69) 10 13. .aa4 
12° 18° 3) iy “aes aed 
=3x/13 —44 | = 3(—32541144) = 3.819. 
26.95 
D = 273. 


(1) For algebraical determinants various devices have to 
be used. An important determinant is 


D=|aqm-1 gm-1 cm-l em-1 ca 
qin-2 Um-2 ci-2. ‘ Elu-2 jure 
SP Se er pee 
a b Cy ae Pe | 
il 1 1 Mel 1 | 


IV. 5 (i) Calculation ef determinant Bi] 


This is = 0 if a=0 or if a=c, ete. Hence it contains 
a—b, a—c,...a—e, a—f as factors. Similarly it con- 
tains b—c,...—e, 6—f; and so on. Looking to the 
leading term, it will be seen that there can be no other 
factors ; i.e. 


D =(a—b)(a—c)...(a—0) (a—f).(b—c)...(b—0) (—Sf)...(e—f). 


[Example.—Hence prove that 


pit-1 ct em fo ie D ; Py 

pir-2 clt-2 Leva gilt—2 7™ == (-)m BG) x coefficient of a’ in p (a), 
- $ (a) = (a—b) (a-e)...(a-e) (a—F)] 
Se i i 


IV. 5. Product of determinants.—(i) The product of 
a determinant of order m and a determinant of order 2 can 
be expressed as a determinant of order m+ % by placing 
the leading diagonals in line and filling in with 0’s, For 
example 


Cee ee, ea On Onc, OL me 
Spawn eo i Le Up ey 0 
Gy On C5 a, UG. O50 
Ome 020 rag ee, 
OD 50 0d.” ve. 


[For the only terms of this latter determinant which are 
not 0 are those for which the first three factors (collectively) 
are taken from the first three columns and rows and the 
next two factors (collectively) from the last two columns 
and rows; and in each such case the first three factors 
form a term of the first determinant and the next two 
factors form a term of the second determinant. Thus all 
2611 E 
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the terms of the product of the one expanded determinant 
by the other are accounted for, and there are no others. | 


(ii) We have now to show that the product of two 
determinants, each of order m, can be expressed as a 
determinant of order m. To obtain the general formula, 
it will be sufficient to take a particular case, e. g. 


— — | 
DEG bf 4, B=|a,, Bao is 
ty Oy Cy a By Ye 
az 63 Cz a, PBs Ys 


provided that in our reasoning we retain m as the order of 
each determinant. 

By means of the first sentence of (i) we can write down 
DE as a determinant of order 2m, i.e. 


U) i ee ee 
Gos Gs OD 
Gon Os) CO) 10 
00 0 am B, nH 
0 O60" a 
0 0 O Qs Bs Y3 


Two of the quarters of this determinant contain 0’s only ; 
and it will be seen, from the method of forming those 
terms of the determinant which do not contain 0 as a factor, 
that we can fill in either of these quarters imany way we 
like, provided we leave the other quarter alone. Also, in 
order to reduce the determinant from order 2m to order m, 
we ought to get m 1’s in the leading diagonal. We there- 
fore shift the last m columns to be the first m, and then 
replace the 0’s in the new leading diagonal by 1’s. The 
first process involves m* interchanges: we can avoid change 
of sign of the determinant, in the case where m is odd, by 
vhanging the signs of the first m rows (before inserting 
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the 1’s), whether m is even or odd. Then, inserting the 
1’s, we get 
DE=|1 0 0 —-a, -b, —e, 

ment OM ber 0) | ae: 
0 0 1 -a, —6, —-e, 
4 f yy 0 0 0 
a By 2 0 0 0 

SB, ys 0 0 0 


We now reduce each of the elements in the right-hand 
top quarter to 0 by means of $ 3; i.e. we add a, times 
the 1st column to the (m+1)th (in this case the 4th), thus 
getting 


i 2 50) *0 —b, —¢ |, 
OO 10) —4,,—0,, -— 6 

0 0 1 -a, —b, —¢, 
% Py yy 44 89 0 

ay By Yz 4a 0 0 

a, Bs Y3 “as 0 0 


then do the same with a, times the 2nd column and 
a, times the 3rd..., and then deal in the same way 
with the (m+2)th and (~+3)th... columns. We get 
finally 


m={ 1 0 0 0 0 0 
ot 0 0 0 0 
@ @ al 0 0 0 


a, By yy May +4,Rytsy, bya, 40,8, +bgy, Ca, +0,8, + C371 

az By Ye Aya, +,Bgt Ms 7q Oyagt+o8.+sg7_q Cy a9 + CoB+ Cs Yo 

dg Bs Ye agta,BgtUsys Dyag+O,8,t+bgyz Cyas+C2R5+Cs yz 
By the same reasoning as that employed at the beginning 
of this paragraph, we can replace each of the elements in 
the left-hand bottom quarter of the above determinant 
by 0. Hence, if the determinant formed by the elements 
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in the lower right-hand quarter of the above is called P, we 
have, by (i), 


DEE VY WOOK Ses 
Oca RO 
No cg chee 
1.e, 
4% b 4 | xi|ay BH 
A, by Cy ag By Yo |: 
a, b, Cy ag Bz Ys | 


10, +4,8, +037, bya1+0,8y+0gy, Ca, +C,8i4+¢yy1|- (IV.5.1) 
Ay Ag+ Ay By+Agy_ Dyag td ,Bot+bsyo Cy ag + C89 + C3 Yo 
14,4028, +0373 0, a3 +0.83 + Dez C43 + C283 + C3 Vs 


The reasoning is quite general, and the product of two 
determinants of any order can be written down from the 
above. 

By interchanging columns and rows in one or other 
or both of the original determinants we get three other 
expressions. All four expressions, of course, are equal 
when expanded: we shall take the above to be the 
standard form, It is to be noticed that DE or DxF 
means the product of D and £, in this order; by reversing 
the order of multiplication we get four other forms, but 
these are only the ‘ transposed’ of the previous four. The 
eight forms are given, in the new notation (see V. 6 (vi)), 
in the Appendix (p. 122). They are based on the principle 
that in the standard form the element in the gth column 
and 7th row of the product is formed in a particular way 
from the gth column of the first determinant and the 7th 
row of the second. 


IV. 6. The adjoint determinant.—(i) Let D’ denote 
the determinant whose elements are the cofactors of the 
corresponding elements in the original determinant, i.e. 


IV.6 (ii) The adjoint determinant 37. 
PSAs By MG ky |: (IV. 6. A) 

ap Benoa. Fr 
F 


A B Cinase nt 

If we interchange columns and rows in J’, and then 
express the product DD’ as in (IV. 5. 1), we find that 
the elements in the principal diagonal of the product are 
A, A, + 4, Ag +06. Ams 0,B, +0,By +... +4 Bm; ete., each 
of which, by (II. 4.1), is =D, while the other elements 
are a, B, + a, B+... +4, Bm, 0,4, + bq Ao t+ 060 + Om Am ete, 
each of which, by (II. 4. 2),is=0. Hence 


Wa 0802 0, tae (IV 6.1) 


ne nev 


® 12) Wooo 
CLL OOD 
and therefore 
D = pr-i, (IV. 6. 2) 


The determinant L’ was formerly called the reciprocal, but 
is now more usually called the adjoint, of D. It is not 
the true reciprocal of D, since the product of the two is 
not 1 but D”™ (cf. V. 2). 


(ii) Let the cofactors of 4,,B,,C,,... /, in D’ be denoted 
by a,,8,,¥15+-5G- Then, applying (II. 4. 3) and (II. 4. 4) 
to D’, we have 

4,0, + Bi y+ Qy+..+ HG = DY 
A,0, + BoB, + Cy, ++ + L2G Fe 0 


Ayy 0+ BB; + Ont +++ Fin G = 0 
We can regard these as equations for determining a,,/3,, 
¥1,-+-G» Comparing them with 
Aya, + Bib,+C,¢4+..+ ff =D 
Ad, + By + C,¢,+..+hf, = 0 


Apt, + By by + Om ey + +0: + Fn fy = 0 
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which are obtained from (II. 4. 3) and (II. 4. 4), we see that 
the solution is 
D yf 
=> = UUs Sal Bienes Ar 7 = D2) ee. 

Hence the cofactor of any element of the adjoint determinant 
as equal to the corresponding element of the original determinant, 
multiplied by the ratio of the adjoint determinant to the original 
determinant. 


V. THE TENSOR NOTATION * 


7.1. Main properties of determinant.—(i) The 
notation so far used is the ordinary one for elementary work. 
For higher work we reduce the number of letters and make 
a more liberal use of suffixes. We shall replace 2, y, 2... w 
by X,, xX, y X33 000 Xs Ay Oy Cy oe Ai by dy, oy) Tsyy 00 Any 3 
and so on. Also, it being understood that the aos 
assignable to each of the letters g andr are 1, 2, 3,...” 
we can use |, r| and |d,9| to denote the ee ee 
whose oenes in the gth column and rth row are respec- 
tively ¢ 7, and d@,4, i.e. 


|For |=| Gr tI. + Gn 

dent ton dey ; ine 

Dry, Com  Egm+++ Eine Buu Vb A) 
| dng [=| 41 Ge yg ++ Ein 

dy dog pry em 

Any ding dng es Lan 


so that the interchangeability of columns and rows gives 


| dq | = | rq | (WV. 1.1) 


(ii) The #’s in (III. 2. A) were supposed to be known 


* The paragraphs in small print may be omitted on first reading, 
but should be read before Chapter VI is taken. 

+ It is more usual to have the suffixes the other way round; i.e. 
to use 11, 12, 13,...1m as suffixes for the first row. But my 
arrangement seems to follow more naturally from the a,b, 
notation, and I also find that it fits in better with the subsequent 
work. 
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quantities, so that the equations served to determine 
2%, Y;2,...w. We shall have to consider the relations between 
the set of quantities which we have denoted by 2, y, ,... 
and those which we have denoted by 4, 4, 43,-+»4m- We 
therefore, in altering 2, y, z,...w to X,, Xj, Xs)... Xm, also 
alter &), Han hey eee Mg BO al gs) gt gate ed ae 


(iii) The main properties which we have to consider are 
set out below. Definitions are marked with capital letters: 
propositions with arabic numbers. 


DE \ dg |=| Gy da Ia + Gam (A) 
Dye gg gg +1 Ins 
} dim dom sin Caan 
D,,, = cofactor of @,, in LD: ca resy aa? ORE 
ea? EA GR een D,,,d 
ae == aes a cee pl lat pega t + + pm qir 
Dif g i (1) 
i Oifg#p 
D' = | Do, ip Bs (C) 
QD a Ps Vs ee eee 
cofactor of Dos in D! = TIO de <i Se 
Tf dy, X, + dg Xo +g Agt 0. + dmg Am = Vy 
Oya Xy + dag Xo + deg Xy + e+ Imam = Vo 
dim x, oa dom Xe + d. 3m A, + td m X, = y,,| 
then (4) 
(pide eye) 
Bs, 1 x D, ot + Dy ats 37 he a Pym kg mm 
ete: D ; 


(iv) As a preliminary, to make the statements more 
concise, we may—though this is not essential—introduce 
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the ordinary notation of summation. Thus (1) can be 
written 
aS Once SS aman 
ie ude Pay Cb a Se) i) 
and (4) can be written 
Bites LS...) ~ dng Xp eon 
p= (4) 
2. D 


Meat i; 2k, = nae 
s=1 
(v) The tensor notation involves five steps, which are 
set out in §$ 2-4, 6, 8 below. The reader will find it 
helpful to copy out the statement in (iii), modified as in 
(iv), and make the successive alterations which are now to 


be described. 


V. 2. Reciprocal determinant.—The first step (which 
will be found in modern text-books) relates to (1) and (2). 
The determinant D’ has sometimes been called the reci- 
procal of D. But, as has already been pointed out in 
IV. 6 (i), it is not a true reciprocal, since the product of 
the two determinants is not 1 but J”. Since, however, 
D’ contains m columns, we see that if we form a new 
determinant by dividing each element of it by D the pro- 
duct of this new determinant and J will be 1. We there- 
fore write (1) in the form 


mol Oca Ut eae ea) 
Cae. pe D cue Oifgxps’ ° Oe 
and form a new determinant D” defined by 
DZD Dae Skea (©) 


This new determinant. will be equal to D'/D™=1/D, so 
2611 F 


42 Reciprocal determinant Nee 


that the product of the two determinants will be 1; and 
on cofactor of D,,/D in the new determinant will be 

dy3/D =a, D". Hens the two determinants D and D” 
are so rolarad that their product is 1 and that the cofactor 
of any element of either determinant, divided by the 
determinant, is equal to the corresponding element of the 
other determinant. We can therefore call each determi- 
nant the reciprocal of the other. 


V. 3. Elements of reciprocal determinant.—The 

next step is to have a single symbol for 
(cofactor of /,,, in D) + D. 
We have already used J, for “the cofactor of d,,,; and it is 
inconvenient to introduce a new letter in place of d or D. 
We therefore denote the above expression by 
dP®, 

We accordingly, in our statement, replace (B), (1), (C), (2), 
(3), and the second line of (4), by 


db? = (cofactor of d,,in D)+D . . . (B) 
yen. eg ee 8 eee 

ae Fe ee ) >a “ee Yo its 0} ae 

LE | GEN aa en 

DD" =1 ae Sse eee 


d.,, = (cofactor of d?* in D'’) + br <a URRY 


(p = 1,2,...m)X,= > dPeY, . [2nd line of] (4) 
oe 
The parallelism of (B) and (3), and of the two lines of (4), 
should be noted. 


V. 4. Set-notation.—(i) The next step is to abbreviate 
(4), as altered. This contains two statements—a hypothesis 
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and a conclusion; they are similar in form, so that we 
need only consider the first one, namely 


(ii) The expression on the left-hand side of this state- 
ment has a definite value for each value of s; we can 
denote these values by 

Ly E, Ey... Bm): 
The statement then takes the form 
ie a eT We ie ie 
We cannot merely omit the ‘(s=1, 2,...m)’ from this, 
without leaving it doubtful whether we are speaking of 
some particular s or of each s. We get over the difficulty 
by omitting the ‘(s=1, 2,...m)’ and replacing the s 
in ‘#,=Y,’ by a Greek letter. The convention then is 
that a statement of the form 
dai 
means that J, is equal to Y, for each of the values of s, 
i.e. that 
LL=Y,, y= ¥,, By = V5,...Em = Vins 
it being understood that the values 1, 2, 3,...m which 
are to be given to a Greek letter have been settled before- 
hand and remain the same throughout our work. 


(ili) Applying this convention to the two statements 
in (4), it becomes : 
eee Votes Ky > Va 5 A) 
p= eal 
It is immaterial what Greek letter we use in either of 
these statements, provided the letter is the same on both 
sides. We could have used the same letter in the two 
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statements, but in the particular case it is better to have 
the letters different.* 

We have rather spoilt the symmetry of (4), but we will 
put this right in § 6. 


(iv) The statement in (1) is a statement as to the value 
of a certain expression for all values of y and all values 
of g; and, so far as the left-hand side is concerned, we 
could extend the above principle by using two Greek 
letters. But the right-hand side presents difficulties; and 
we must therefore leave this over for later consideration (§ 8). 


V. 5. Principles of set-notation.—It is desirable at this stage 
to consider the principles underlying the notation which we have 
just adopted. 


(i) Take first the case of a single set of m quantities or elements; 
i.e. an aggregate of m quantities which fall into a certain linear 
arrangement. We denote these by, say, 

A AD Ag scu Lyn 
We have settled that a statement such as 

Ay = Ey 
is a comprehensive way of saying that 
Ahi AN spe Act) beeen ta alee 
Thus we use 4, etc. when we are referring to a particular member 
of the set, and we use A, etc. when we are making a statement 
with regard to each member of the set in turn. We may also 
want to speak of the set. as a whole. It will be found that no 
confusion arises from using A, in this sense also. We can there- 
fore say that 
A, =(A, 4, A;...Am) } 

the brackets being used in order to show that we are considering 
the set as a whole. In this sense we might regard the statement 
A, = E) as meaning that the set A) as a whole is equal to the set 


* I have as far as possible used X, pn, p, ¢ in this chapter to corre- 
spond to p, g, 7, 8, reserving v for product-sums (§ 6). Later on it 
is better to have no fixed rule, beyond that laid down in § 5 (iii). 
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E) as a whole, this equality of the wholes implying equality of the 
parts. We shall have to take this step later (Chapter VI): for the 
present it will be sufficient to regard the statement A, = Fy as 
merely an abbreviated way of saying that A, = E,, A, = E,, etc. 


(ii) The interpretation of A, as the set as a whole recalls the 
idea of a vector as the resultant of a number of components. But 
the operations which we shall have to perform with single sets do 
not follow exactly the same laws as those which govern operations 
with vectors as ordinarily understood (see note to (vi) below), so that 
the analogy must not be pushed too far. 


(111) Next take the case of a double set of m? quantities, i.e. a set 
consisting of m single sets, each containing m elements. If we 
denote the elements of the gth single set by Fy, Foy, «+ Fons 
this single set as a whole can be called Fy, and the complete 
double set can be called F,,. We think of the elements as 
arranged in a square, the Solas of which are the single sets: by 
regrouping (e.g. F,, Fy, ...Fimr) we get the rows of the square. 
We have already, in § 1, adopted the convention that in dy, or dg 
the g represents the column and the 7 the row; and similarly we 
shall say that in F,, or F,,, etc. the first letter, according to alpha- 
betical order, means the column and the second the row. Hence 


Lr Fy, Fy, Fy On re DF ey Fy, Fy Fy3 v0 Fim 


iG Fy» Psp Je Fina | En Fn Fs +++ Bom | 
: : : 
Fim Fo oe Fy Fis ae Fron) 


the brackets being inserted, as before, in order to show that the 
set of quantities is in each case to be regarded as a whole. Then 
the statements 


=Gups Fyup= Apps 


mean respectively ee gr = Fgr: nee that Fy, = H,,, for every 
value of g taken with every value of r. 

A double set is symmetrical if it is not altered by interchanging 
columns and rows. 


(iv) A particular form of double set is obtained by multiplying 
together every element of one single set and every element of 
another single set (of the same number of elements). If these two 
sets are B, and C, (in this order), the representative element of 
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the resulting double set will be B, C,, so that the double set can 
be represented by B, C, or Cy By. This double set is called the 
product of the two single sets. It should be noted that we must 
not write it as ‘B,C,’ or as ‘B,C,’; partly because this would 
not define the particular arrangement of the elements of the 
double set, and partly because we shall presently have to give 


a special meaning to these latter expressions. 


(v) In addition to double sets and single sets, we have to use 
single quantities, such as a or k. Any such quantity is called 
ascalar. It need not be a constant: it may, as will be seen later, 
be a definite function of the elements of one or more sets. 


(vi) We shall for the present be dealing only with expressions 
which, interpreted according to the laws of ordinary algebra, are 
obtained from scalars, single sets, and double sets by addition, 
subtraction, and multiplication. The rule of interpretation is the 
same that we adopted in (iv) for B, C,: we replace the Greek 
letters App...bypqr... and take the total expression to be 
the set obtained by giving to each of the quantities pq7... each 
of the values 1 23...m. For example: 


(a) kA, means the single set whose elements are 
1A, kAgy...kAyn 3 
(b) 4, +b By, means the single set whose elements are * 
A, FOB, ADB Aah Bas 
(c) Ayp—-aF,G, means the double set whose element in 
the gth column and the rth row is 4y,—aF),@q. 
It is obvious that this system of interpretation is in accordance 
with the laws of ordinary algebra; for instance 
k(A,+B,) =kA,+khB, =kB,+kAy, 


PF) (Gp— Hp) = FP, G)—- FH, 
and so on. 


We are further restricted, in the case of expressions containing 


* Tt will be seen from (vi) (a) and (b) and from (iv) that single 
sets follow the same rule as ordinary vectors as regards multipli- 
cation by a scalar, addition, and subtraction, but not as regards 
multiplication together. 
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more than one term, to (1) scalar expressions, (2) single sets 
arising as sums (‘sum’ of course including ‘ difference’) of expres- 
sions which contain the same letter, e.g. aA, + DB) +..., (8) 
double sets arising as sums of expressions which contain the same 
pair of letters, e.g. aA,, + bB,C,. We do not therefore have to 
consider such an expression as A, + B,, which is really a double 
set, not a single set. 


(vii) The suffixes which we have so far attached to a symbol have 
usually, in accordance with the regular practice in algebra and 
with the ordinary meaning of the word, been placed below the 
line: the exception being the use of d?s to mean (cofactor of dy. 
in D)+D. This latter system of having upper suffixes as well as 
lower suffixes will sometimes be found convenient. We may, for 
instance, want to denote a single set by A*; and in that case 
A’, A®,... would be members of the set. Where there is any risk 
of confusion, we shall not use the ordinary indices of algebra at 
all; thus the square of A, will be A, Ap, not A,. 


V.6. Product-sum notation.—(i) Our next simplifica- 
tion consists in dropping the sign of summation in (1) 
and (4). But, since merely to drop it and to replace, say, 


> Wd,, by d%dy, would be misleading, we use a 
s=1 

special notation. The number of alphabets at our disposal 
is limited: and it will be found not only that we can use 
Greek letters for this purpose without risk of error, but 
that there are actual advantages in doing so. 


(ii) The rule we adopt is that, when an expression of the 
form B,C, has to be summed for the values 1, 2,...m 
of p, we denote the result by replacing p by a Greek letter 
in both places; and, conversely, the meaning of such an 
expression as B,C, is 


B C, = BC+ B,C, + ot Bin Cm: (Vv. 6. A) 


ans 
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(iii) The pair of v’s in B,C, could be replaced by a pair 
of any other identical Greek letters; e.g. 
B,C, = BC,. (V. 6.1) 
The v (or p) is for this reason called a dummy. We can 
think of the sum represented by B,C, as the result of link- 
ing the elements of B, with the corresponding elements 
of C,; we can therefore describe a Greek letter which 
oceurs twice as a linked suffix, and one which occurs once 
only as a free suffix. 


(iv) The rule in (ii) applies if either or both of the expres- 
sions B, and C,, has a free suffix as well as the p; eg. 
A, B,, means A, B,, + A, Bi + 0» + Am Bums and 4,, B,, 
means 4,, Boy + Ayo Boo +... + Axm Bom, Which is a double 


set whose typical element is 4), B,,. 


pi 


(v) We can also have successive summations expressed 
in the same way. Thus 


BQ DP hoy 


involves summations with regard to A, with regard to y, 
and with regard to p. It is easy to show that these sum- 
mations can be made in any order: e.g. we can take C,, 
and ty together as if the X and p were free, and then 
bring in B, and £,. 


(vi) As an example of the brevity effected by this 
notation we may take the expression for the product of 
determinants. Even for so small a value of m as 3, the 
expression obtained in IV. 5 (ii) for the product of two 
determinants is formidable. We can condense it by intro- 
ducing S’s, but the result is clumsy, In the new notation 
it will be found that the method of IV. 5 (ii) gives 


| Cor | x | bor | = | Gor Dry |. (Vv. 6. 2) 
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The process can be repeated: e.g. 
| Lor | x | bor |x| Cor |x | don | = | B94 Cp Lyr |. (Vv. 6. 3) 
(vii) The result of applying the product-sum notation to 
the statement in § 1 (iii) is that (1) and (4) become (p and 
s in (4) being replaced by A and o respectively) 
P=1,2,...M3\ no, _ (lifg=p 
g= 41) 20 )a dg = 10 ema i) 
oe ae men Ae a gl. 4(4) 


Vy. 7. Inner products of sets.—(i) The quantity B,C, behaves 
in many respects like an algebraical product. We call it the inner 
product of B, and C,, to distinguish it from an ordinary or outer 
product such as B,C, (§5 (iv)). The inner product of B, and C, is 
the sum of the elements in the leading diagonal of the outer 
product of B, and C,. 


(ii) In the same way 4,B,, is the inner product of A, and 
Buy, and Ay, By, is the inner product of A), and B,,. 


(7ii) The process of forming an inner product, as above, may be 
called inner multiplication. 


V. 8. Unit-set notation.—(i) We have finally to con- 
sider the form of (1), which is a statement that 
Ye 8 lp Rae Ob errr =1) pied 
(e =—1, 2,...m )# boa Oifg#psS 
So far as the left-hand side is concerned, this is a statement 
as to the values of the elements of a double set 


dd... 
As regards the right-hand side, however, the statement falls 
into two ; first that d?°d,,, = 1, and next that dP* dog = 0 if 
p and gare different. We want to replace these by a single 
statement. 


(ii) We do this by converting the statement into one as 
2611 G 
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to the equality of two double sets. For this purpose we 
construct a set whose typical element, in the gth column 
and 7th row, is 1 if g and 7 are the same and 0 if they are 
different ; a set, in other words, which has 1 for each 
element of its leading diagonal and 0 everywhere else. If 
we call this set * 

[ns 
then our definition of |/ is that 
the function of p and g which 


aye ings 
es ee (V. 8. A) 


We can therefore write (1) in the form 


jo = tly yonedne mG wd, 
G sar b Zonas 
or, in the set-notation, 


De dhe oe tam Ce aes 


[= 


V. 9. Properties of the unit set.—(i) We have defined |* as 
the set whose typical element is 
Poe anee (V. 9. A) 
2 CONig ep 
Hence each element of the leading diagonal of e is 1, and the 
other elements are all 0; in other words 


[;=(1 0 0 0...0). (V. 9. B) 
i LO e Vasc? 
001 at 

0 


\j 


* The usual symbol, adopted by Einstein, is 52; J. E. Wright 
(‘Invariants of quadratic differential forms’) uses n,,,. Neither of 
these seems sufliciently distinctive ; and 6 already has a consider- 
able number of other uses. I have therefore altered the symbol to 
|* (Cunit \’), as an experiment. 


0 sOna 


V. 10 (i) Properties of the unit set 51 


This set (with any pair of letters) will be called the unit set. The 
following are its chief properties. 
(ii) The set is symmetrical, i. e. 

y= |k. (V. 9.1) 

(111) The determinant of the set is 

HZj=]1 0 0.2 

ORO 

00! 2... 

a ak 

(iv) Also, if we multiply together this determinant and any other 


determinant, in either order, it will be found that we merely 
reproduce the latter, i.e. 


HF] x | eqn | =| @gr | x [LF |= lage |- (V. 9. 8) 


=1, (V. 9. 2) 


[—o) oO (=) 


(v) The special importance of the set, or of any column or row 
of it, les in its effect when combined with another set to form 
a product-sum. It will be found that, ¢ having any one of the 
values 1, 2, 3,...m, 

pAp=[f4p=45 |i 4o=|%4—4n, (V-9. 4) 
4p =|, 4e=40, [24 =| 5 4p =~ (V- 9. 5) 
[For example, take ¢ = 3 in the first part of (V. 9.4). Then 
[ $4, =124:+]34.+134, +[24,+... 
= 0) -Ay tO, A An 0. Agee 
as A,.] 
Thus the effect of inner multiplication by |} of a single or double 
set which contains p (or A) is to alter the latter to A (or p). 


V. 10. Determinant properties.—(i) Before we write 
down the final results, there is another small change which 
we shall find it convenient to make. In the statement 
C20 = Ve obtained in § 8 (11), the linked suffixes are an 


v7 
upper o and a lower o, which cancel one another; and the 


52 Determinant properties V. 10 (i) 


free suffixes \ and y are in the same respective positions 
on the two sides. It is desirable that, whenever possible, 
these two conditions should exist. In each of the state- 
ments d,, X, = Y, and X, =a’ ¥,, given at the end of §6, 
one of the conditions exists but the other does not. We 
make them both exist by replacing X,, throughout, by 


SS (X1 bb) Gy 
as explained in § 5 (vil). 


(ii) Our statement, after carrying out the alterations 
indicated in §$ 2—4, 6 and 8, and in (1) above, becomes— 


Notation. DS | Gigs (V.10. A) 
db* = (cofactor of d,, in D)+D. (V.10. B) 

DY = | da |. (V.10. C) 

Properties. dd. a [}: gre: eae 8 
DUS 1. (V. 10. 2) 


dys = (cofactor of d?* in D") + DO TAGS) 
If de 5 dy, X", then X* = a” tr (V. 10. 4) 


To these we may add the formula (V. 6. 2) for multiplica- 
tion of determinants, namely 


| Gor | x | bor | = | CANO Is (Vv. 10. 5) 


V.11, Example of method.—To illustrate the methods 
that we are now able to use, let us verify (V. 10. 4) by 
means of (V. 10.1). It is given that 

Yi teth des 
To find the value of @’ Y,, it will not do to replace Y, by 
the above value as it stands, since we should then have 
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three \’s. We must first replace \ in the expression for 
J, by some other suffix, say yp. We then have, by (V. 10. 1) 
and (V. 9. 5), 

G°Y = dd, X" mo) et ni 
which is what we wanted to prove. The reader will find 
it instructive, for comparison, to write out the proof in the 
ordinary notation. 

In the above proof we have proceeded from d@*’ (d,,, X") 
to (d*’ d ug) A’. It has already been pointed out, in $6 (v), 
that summations in a case of this kind can be made in any 
order, 


SETS 


VI. SETS OF QUANTITIES 


VI. 1. Introductory.—(i) It may have been noticed that 
in V. 11,in deducing (V. 10. 4) from (V. 10. 1) and (V. 9. 5), 
we made no direct use of determinant properties: the only 
indirect use being in the relations between elements and 
their cofactors, from which (V. 10.1) was derived. But 
we can dispense even with this indirect use. In the equa- 
tions d,,X*=Y, the values of d,, are supposed to be 
known; and we can treat the statement in (V. 10. 1), 
namely 

dig = [n. 
as a set of equations giving the values of d*’ in terms of 
those of d,,. If, for instance, m = 20, the set d,, contains 
400 elements, and (V. 10.1) is a condensed statement of 
the 400 equations (each with 20 terms on one side) which 
give the 400 values of d‘”. Thus for p=2 we should 
have 


Ay D** + day d*® + dog d?? +... + dom d?™ = 1 


dy, + dy, d? + d,,4 +... 4+ dyy,d™ = 0 
Ay 07! + digg J? + dag O? + 04. + dau, 02 = 0 P 


day B® + Cy B® + digg 4? + 000+ Ui: B™ = 0 | 


which give the values of d?!, d?2, d?3.,.d?™, i.e. of d?’. 
Similarly for d!’, d*’, ete. 


(ii) We have, in fact, arrived at a position similar to 
that reached at the end of the first chapter. We started 
with the problem of solving a set of simultaneous equations, 
and arrived at a probable solution, involving what we 

2611 H 
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called determinants. To verify the solution, we had to 
investigate the properties of determinants. The determi- 
nant thus took the leading place, its applicability to the 
solving of equations being one only of its properties. 
A determinant of order m is based on a set of m” quantities, 
which for convenience of reference are thought of as 
arranged in a square, the determinant being expressed by 
enclosing the set of symbols of the quantities between 
vertical lines: and we have reached the stage at which the 
set of quantities becomes the important thing, its existence 
as the basis of a determinant being one only of its 
properties. 


(iii) These properties we have now to consider. The 
following sections of this chapter are mainly a restatement, 
with obvious modifications and extensions, of results ob- 
tained in the preceding chapter, 


VI. 2. Single sets.—(i) We may have a single set 
A, =(4, 4, 4;...4). 
The separate quantities 4, 4,...4,, comprised in the set 
are called its elements. The order of the set is the number 


of elements comprised in it. The typical statement with 
regard to such a set is of the form 


vi — E,. 
This, in the first instance, we regarded merely as a short 
way of saying that 
Ay eB oA ee ects 


mM 9 

but we must now think of it as a statement that the two 
sets A, and L, each taken as a whole, are equal, this 
equality of the wholes implying the equality of correspond- 
ing elements. The analogy of a vector may help us here. 
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The statement that two vectors are equal implies that the 
components are equal, each to each; but what we really 
think of is not the separate equalities of the components 
but the equality, in all respects, of the vectors. 


(ii) Single sets behave like ordinary vectors as regards 
addition and subtraction and multiplication by a scalar 
($ 4 (iii)), but not as regards multiplication of one single 
set by another. 


VI. 3. Double sets.—(i) We may have a double set of 
order m—1.e. ape m” elements— 
a =(4y Ay A, .. : Amy \. 
ae ee, | 
Ui. A, ne Aon Zao ee) 
Here the quantity in the gth column and rth row is 4,, 
We adopt the convention that the first Greek letter (in 
alphabetical order) represents the column and the second 
the row, so that 
Ay As Aj; ea \> 
Ay, Ag, Aggy «+ Aom | 
DO Na Bee 
Am Ame Ams + nee) 
the representative element of which is 4,,. The sets 4,,, 
and An, are called the transposed of each other. 


(ii) The determinant 
| Agr | 
is called the determinant of the set* 4,,, and similarly 
| 4, | is the determinant of 4,,,. 
* The set is usually called the matrix of the determinant, It is 
a singularly inappropriate name, as the symbol of the set is the 


inner part of that of the determinant, not something which sur- 
rounds it. The setis really the substance or core of the determinant. 


60 Double sets VI. 8 (iii) 


(iii) The brackets in which the elements of 4,, 4,,, 4,, 
have been placed are not essential,* and have been intro- 
duced partly to help the eye and partly to indicate that the 
sets are being considered as a whole. 


(iv) A double set is symmetrical if columns and rows can 
be interchanged without altering it. Thus, if 4,, is sym- 
metrical, then A,,=A ae and conversely. 


VI. 4. Sets generally.—(i) We describe a single set 
as being of rank} 1, and a double set as being of rank 2. 


(ii) Similarly a set of rank 3 of order m is made up of m 
double sets of order m; and soon. Thus we might repre- 
sent a set of rank 3 by 4,,,-. There would have to be 
a convention as to the order of the symbols, so as to dis- 
tinguish 4,,, from 4,,,, etc. Where, however, the set is 
symmetrical, so that 4,,,=4,,,=etc., this difficulty does 
not arise. 


(iii) The set of rank 0 isa single quantity or scalar. 


(iv) To denote a set generally, without reference to its 
suffixes, we use a Gothic letter such as 4 or 2B. 


* An alternative method, in the case of a double set or matrix, 
is to enclose the symbols between two pairs of vertical lines, so as 
to distinguish the set from the determinant, which has two single 
lines. It is not a satisfactory symbolism from our point of view, 
as it would seem to suggest that the set is more restricted than the 
determinant, whereas what we are aiming at is to free the set from 
the bonds of the determinant. 

t+ I have been doubtful as to the appropriate word. In reference 
to tensors Einstein uses Rang, Hilbert Ordnung, Eddington rank, 
de Sitter order. The objection to either of the last two is that 
there is already a settled meaning for order as regards a determinant, 
and (though this is not so important) for rank as regards a matrix 
(see note to § 8 (ii)). It would seem reasonable to describe a set 
containing m/ elements (i.e. composed of m sets each containing 
m~* elements) as being of degree f. I have, however, felt bound to 
keep to Eddington’s use of rank. 
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VI. 5. Sums and products of sets.—(i) If two or 
more sets, of the same rank and the same order, have the 
same suffixes, we add (or subtract) them by adding (or ils 
tracting) corresponding elements. Thus 


4,+ 8, =(4,+8, 4,4+B, A,+B;...dmt+Bry); 


mm 
and similarly for sets of higher rank. 
(ii) We multiply a set, of whatever rank, by a scalar 
when we multiply every element of the set by the scalar ; 


ates 


kA, =(hkAy kd kdy ...tAm) )- 
hk Ayo kAy» kAso kAing 


\ kAym kAom kAgm tne W Aaa 
Thus the determinant of kA,, is not & times, but 4” times, 
the determinant of 4,,, 

(iii) If A and 3B are two sets, with different suffixes, o 
ranks f and g respectively (fand g not being necessarily 
different), their product 4 26 is the set of rank f+g ob- 
tained by multiplying every element of one by every 
element of the other. (Here, as elsewhere, we assume that 
all the sets we are considering are of the same order.) 
Thus the product of two single sets 4, and B, is the double 
set 4,B, obtained by giving to p and o separately each of 
the values 1 to m. A product obtained in this way is 
sometimes called an outer product, to distinguish it from 
an ‘inner’ product as defined in § 6 below. 


VI. 6. Inner product.—(i) When a suffix occurs twice 
in an expression such as 4,, or B,C,, or, more generally, in 
any single expression or product, e.g. Ayyy,,,, OF DS ee ye os 
(where the letters may be in any order), this means that the 
expression is to be summed for the values 1, 2...m of the 
suffix ; e.g. 
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B,C, = B,C, + B,C, +...+BnGn 
BD, = By Dy + BoDyot «+ bia)" (VI. 6. A) 
ete. 

Where a letter occurs twice in this way, each of the two 
letters is linked. Where a letter occurs once only, it is 
free. The linked suffixes are called dummy, as they can be 
replaced by a pair of any other suffixes not already occurring 
in the expression. 


(ii) In the particular case where the expression to be 
summed is of the form B,C,, the result is called the inner 
product of B, and C,, or of B, and C,, ete. It is immaterial 
what suffixes we use in this latter description, since they 


have to be replaced by one and the same suffix. 


(iii) From a pair of double sets 4,,, and B,,,, or A,, and 
By, We can by a single product-summation form several 
different double sets 4,,, By yy Any Bs oe By, ete. There 
is also the scalar quantity a), Be formed by two summa- 
tions, which can be simultaneous or successive; if suc- 
cessive, the first is a product-summation giving us the 
double set 4,,B,, or 4,,B,,. Strictly speaking, this 
scalar quantity 4,, B,,, is the inner product of 4,, and 
B,,. But it occurs less frequently than the double sets 
obtained by a single summation, and it is therefore more 
convenient to call one of these latter the inner product. 
We shall call ie ge the complete inner product of 4,,, and 
B. By analogy with the expression found in V. 6 (vi) 
for the product of two determinants, we define the inner 
product* of two sets 4,,, and B,,,—or, more generally, of 

* This is what, in the case of matrices, is called the ‘ product ’. 
The true product of two sets A,,, and By, is a set of rank 4. 

If this use of ‘inner product’ seemed hkely to lead to confusion 


with the ‘complete inner product’, we could use a different 
phrase, such as ‘interproduct’. It should be noted that the inter- 
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two sets 4, and B, —, in each of which the letters are in 
their proper alphabetical order, as the set 4,, B,,; the 
linked suffix in this latter expression being chosen so as to 
be (alphabetically) intermediate between the two free suf- 
fixes. This is equivalent to saying, as regards this case, 
that the imuer product of two double sets is the double set 
whose element in the qth column and rth row is the inner 
product of the qth column of the first set and the rth row of 
the second set ; and we apply this rule to all cases. The 
simplest way of applying it is to use the result for 4 up 2nd 
B,, and alter the order of the suffixes where necessary. 
Suppose, for instance, that we want the inner product of 
oa, and Baus then by writing eas we see that the 
inner product is 4,, /,,=4,, B,,. It should be noticed 
that in all cases the inner product depends on the relative 
position of the original sets; thus the inner product of 


B,, and A, is not A,, B,, but B,, A,» 


(iv) There are four main forms of inner}product constructed in 
accordance with (iii); and four others, which are really repetitions, 
can be obtained by interchanging the two sets. Denoting the 
inner product of A and 4% (in this order) by & x 438f(cf. IV. 5 (1i) as 
to product of two determinants), the forms are as follows: 


P= Ay B,, 1) Buy 4yy—ByyAyp = Ay, By (9) 
AupX Boy =Ay Boy (2) Box Aup = BrpAvp = AvpByy (6) 
Apyx Buy =AyypByp (3) — Bupx App = Buy Apy = Ap Bw (7) 
eee By 4) Bag Age = By, Ap = Ap Bey (8) 


(v) The transposed of an inner product such as A, By, is found 
in the usual way (V. 5 (iii)) by interchanging the free suffixes p 
and p. By comparison of (1) with (8), (2) with (7), etc., it will be 
seen that the transposed of the inner product of two double sets is the 


mediate product-sum, for A,p)Byp, is not Ay, Byp but. either 
A,,B,, or Ay,By - There are various reasons for taking the 
former, rather than one of the two latter, as ‘the’ inner product. 
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inner product of the transposed sets, in reverse order; e.g. the trans- 
posed of the inner product of Ap, and Bp, is the inner product of 
B,, and Ay». 

(vi) It will be seen by comparison with IV. 5 (ii) and the 
Appendix that the rule for construction of the inner product of 
two double sets is exactly the same as that for construction of the 
product of two determinants; so that the determinant of the inner 
product of two double sets—whether we call them (say) A,, and 
Boy, or Ayo and B,,—is equal to the product of the determinants 
of the two sets. 


VI. 7. The unit set.—(i) The unit set* 
I, 
is defined as the set whose typical term is 


Pikes 


ee nee (VI. 7. A) 
so that 
ee(t 0 0 fer (VEST 3) 
Pie Ob tla 
> Oe cin On sO 
0 weno 


(ii) From the definition it follows that the set is sym- 
metrical, i.e. 


Poth, (VI. 7. 2) 
and that 
tales (VI. 7. 3) 


(iii) The special property of this set is that, if 4, is any 
set (possibly containing other suffixes o r...), then 

1:4,=1,4, = 4,, (VI. 7. 4) 

so that the inner product of the unit set and any other set 


* This is by analogy with the ‘unit matrix’. 
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is the same as the latter but with the suffix changed. In 
other words, the unit set acts, for inner multiplication, as 
a substitution-operator. 


(iv) In particular, the inner product of two unit sets is 
a unit set, i.e. 


hea (VI. 7.5) 


VI. 8. Inverse double sets.—(i) We take any double 
set 


Arp; 


and we say that there is another set 
APE 


connected with it by the condition that the inner product 
of the former and the latter is a unit set, i.e. (see §§ 6 (ii1) 
and 7 (i)) that 

A, A” = |f. (VieS51) 


This represents m? equations, which are sufficient to deter- 
mine the m? values of 4”! when those of 4 , are known. 
The set A, as defined by the above condition, is called the 
inverse of the set 4,,. We shall keep to this notation, so 
that (VI. 8. 1) will always hold, however we alter the 
letters A, pu, v, p. 


(11) The above is subject to one condition. If we write 
down the equations which determine the elements in, say, 
the second row of A“, namely 


4,,A" + 4,,A" + A,, 42 + ane + Aya = 0 

Ay, A” + Ao A” + Ay, A +... + Ay, A = 1 

A;, A”! +- Ay, A +- Ag, A? +... + Aggy A = 0}? 
Bet tnd tA gl... + Any A 0 


2611 iL 
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we see that in order that there may be a solution it is 
necessary that we should have | 4,, | # 0, which is the 


same thing as 
| dor | # 9- (VI. 8. 2) 


This applies also to the other rows. It is a sufficient as 
well as a necessary condition for the existence of AP. 


(iii) Taking it that | 4,,| # 0, we have, by (V. 6. 2) and 
(VI. 8. 1) and (VI. 7. 3), 
|40 |x | 4g |=l4% 4p =l$|= 1. (VE. 8.3) 
It follows that 
| Air | 0, | AT] #0. (VI. 8. 4) 


(iv) The statement (VI. 8.1) is a statement as to the 
m® relations obtained by taking each value of p with each 
value of p. It is therefore equally true to say, by inter- 
changing yp and p, that 

A™ A. = |b. (VI. 8. 5) 
The expression on the left-hand side is (§ 6 (iii)) the inner 
product of 4"? and 4,,; and these are the transposed of 
AK and 4, respectively. Hence, if 25 is the inverse 
of A, the transposed of @ is the inverse of the trans- 


posed of 3B. 


(v) The relation in (VI. 8.1) is a relation connecting 
columns of the original set and rows of the inverse set. 
There is a similar relation connecting rows and columns. 
For (VI. 8.1) gives 


rv ov ) iX AN rx 
AP A yA S| As a |S aG 
whence, as will be shown in § 9 (v), it follows that 
i A 
AM Ai =p (VI. 8. 6) 


and hence also 
IN A” = [es (VI. 8. 7) 
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The expression on the left-hand side of (VI. 8. 6) is the 
inner product of 4’ and 4,,, which is the equivalent 
of that of A’* and 4,,,3 so that, by the definition in (i), 
the latter is the inverse of the former. Hence, if 2% is 
the inverse of @, then @ is the inverse of 25. The 
relation (VI. 8.7) is similar to (VI. 8.5), and shows that 
the transposed of 35 is the inverse of the transposed of . 


VI. 9. Reciprocation.—(i) Suppose there are two single 
sets X* and Y, connected by the relation 
i= Ay, ae 
Then, as in V. 11, we have 
EVES TIS iy Seas) Pp a Ae 


ped =A eX, thene X74 ay as (Vin Ont) 
Similarly by taking X* to be each single set, in turn, 
of a set of second or higher rank, with Y, to correspond, 
we find that 


If Y,,,,,.=4,,X%,,,, then X2, = A’Y,,.... (VI.9.3) 


(ii) Thus the operation represented by 4,, is annulled 
by the operation 4*”; and conversely. The sets 4,, and 
A’ will be said to be reciprocal to one another: and the 
process adopted in (VI. 9. 1) and (VI. 9. 2)—which we shall 
have to use very frequently—will be called reciprocation. 


(ili) We see from § 8 that the reciprocal of a set is 
the transposed of the inverse of the set, and conversely. 
If a set is symmetrical, its inverse and its reciprocal are 
identical. 


(iv) As an example of the application of (VI. 9. 1), 


suppose that 
A,, X* = 0 
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for all values of c, Then, by reciprocation, 

X= 4’0= 0, 
provided that |4,,|40. (This is practically the same 
thing as saying that, if the inner products of X* by m in- 
dependent single sets are all 0, then X* is 0.) 


(v) Similarly, suppose that 
ALO =A 4 
identically, i.e. for all values of AX and o, and that 
| 4,,| is not = 0. Then, by reciprocation, 
GY = AY’ A,, Ge = le = pe. 

Thus we can divide both sides of the equation by 4,,. 
This supplies the missing step in § 8 (v). 

(vi) The two definitions, and the proposition, used in 
the establishment of (VI. 9. 1) are 


Liit=% 
Gus fl 
[i= ey. 2 Poe 
|, At = a", x Se oe ee 
Ay Ae eo eee 


and from these we deduce (VI. 9. 1). We could have 
altered the order in various ways. For instance, we 
could have defined eS by (1); thence, by giving A its 
successive values, and equating coefficients, we should have 
got (A). Also we might have defined d*’ by (VI. 9. 1), 
instead of by (B), as the coefficients of Y, when the equa- 
tions Y, = A,,X* are solved for X*. This would give 
X= AY A, oA“. Then, if we defined |. by (1), we should 
have 4.4, =|%; or, if we defined 1.7 by (B), we should 
have X* = Ts X*, which is (1). 


VI. 10, Continued inner products.—(i) We can construct con- 
tinued inner products without ambiguity, provided we adhere to 
the rule laid down in § 6 (iii). Suppose, for instance, that we 
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want the inner product of A,,, By z, and Cz,. That of Ay, and 
Bq, according to the rule, is A), Bug. If we call this Fy,, then, 
by (2) of § 6 (iv), the inner product of Fy, and Cz, is Fyy Coy; i.e. 
the inner product of Ay,, Byg, and Og) is Ay, Byy Coy. Similarly 
that of A),, Byg, Cg, and Dyg is Ayy Buy Cov Doo: 

(ii) On the other hand, the inner product of Az), Bz,, Cyg, Don 
is not Ago Boy CyuyDyr. For, by (4) of § 6 (iv), that of A,, and By, 
is By, A, ,. Calling this G,), the inner product of G,, and Cyg is, 
by (8) of § 6 (av), Cyg Gy, = Cy By, A, ,- Similarly that of Aj), 
Bo, Cro Dg is Dep Cyp By yA yr: 

(iii) The transposed of the inner product of any number of double 
sets is the inner product of the transposed sets, in reverse order; 
e.g. the transposed of the inner product of Ag, Byg, Cg,, Dro i8 
the inner product of Dg, Cyg, Boy, Aga. [For we have shown, in 
§ 6 (v), that this is true for the inner product of two sets; and 
thence it follows, by induction, for any number of sets.] 


(iv) The inverse of the inner product of any number of double 
sets is the inner product of the inverse sets, in reverse order; e.g. 
the inverse of the inner product of Byg, Cag, Dag, Lag is the inner 
product of-7%, D°o, C27, Bem a. e; 

If Agg = Bag Cpy Ds E50, then A7* = B% C°Y D8 B82, (VI.10.1) 
[Denote this latter expression (right-hand side) by 7%, and alter 
Byéinittopyp. Then the inner product of A,, and F is 
Ag F™ = Bag B? Cg, C% Dg D’# Es, EM 

= Bag B?Cg., Cc” D3 D’* |; 

= Bug B® Cz, Hid be 

= Bug BP |e 

IG 

B 
Hence, by reciprocation, 
FA = Aw%* ig a Ace 

so that He De wAC | 

(v) It follows from (iv) and (iii), since A°’ is the transposed of 
A, that the reciprocal of the inner product of any number of 
double sets is the inner product of the reciprocal sets; e. g. 


If Ago = Bag CgyD4sE5o, then A? = B*? C87 DY BP, 
(VI. 10. 2) 
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VI. 11. Partial sets.—(i) When we are dealing with a set 
Ay = (Ap Ay Ap oAm) 

we sometimes want to consider the separate or mutual relations 
of groups of the 4’s. The simplest case is when the set divides 
into two groups, one group consisting of k elements, which we 
shall take to be the first k, and the other group consisting of the 
other m—k elements. If we use suffixes ay... in reference to 
the first group, and dx... in reference to the second, reserving 
A pv... for the set as a whole, we may treat the two groups as 
partial single sets of orders & and m—k respectively, and write 


Ag =(A, A...Ay), Ap = (Anti Antor+-Am): 


(ii) In the same way a double set 4, may fall into four groups 
by division by two lines cutting off & columns and & rows 
respectively. We could denote these groups by 

Aay Agy ; 

Aay Agy 
¢ being regarded as coming before y. The groups dq, and Agy 
would be partial double sets of orders k and m —k respectively. The 
groups Agy and 4,4, would each have different numbers of columns 
and of rows, and therefore would not be double sets; but this 
would usually not matter, as we should be specially concerned with 
Aq, and Agy. The important point to notice is that, if we take 
Aq, say, as a partial set and construct the inverse set A7* or the 
reciprocal set A°Y, the set so constructed will not in general be 
the same as the set made up of the corresponding elements of the 
set inverse or reciprocal to 4,5. The inverse set A7%, for instance, 
is given by 

AqgA” =| 2, 

with summations made only from 1 to & instead of from 1 to m. 
To avoid mistake, we may write it (A¥%),. Similarly the set 
inverse to the partial set 4,y may be written [Av] 


m —k 

(iii) If, however, all the elements in the portions Aqy and Ay, 
are 0, so that the set A, practically consists only of the two 
partial sets Aq and Ay, this is also the case for the complete 
reciprocal set AM; all the elements in 4°¥ and AY are 0, and 
the elements of A4°Y and 4? are just the same whether they are 
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regarded as obtained from the complete set 4, p or from the partial 
sets 4g, and Agy. 


(iv) If we had two or more sets, single or double, divided in the 
manner described above, we could take portions from different sets 
to form new sets. If, for instance, we had divided A, into 4, and 
A,,and By into By and By (orders again k and m—k), we could 
construct a new set consisting of 4, and By. 


(v) It would, of course, be incorrect to describe this new set as 
being A,+Bg, or the old set as being A,+Ay: for we cannot 
add together two sets of different order. We could, however, 
look at the matter in another way. Consider the two sets 

(TA ie 0 0s 2500), 

(OPO Oe dine oe 
The sum of these, if we regard each as having the suffix A, is A) ; 
and in this sense, if we denote the two sets by 4, and Ay, and 
regard a and ¢ as connoting X, we could say that 

Ay = Ag+ Ay. 

If we compare A, with a vector, we see that A, and Ay correspond 
to the projections of A, on a ‘ plane’, i.e. a surface of the first 
degree, passing through the first & axes, and on a ‘plane’ through 
the last m—k axes, respectively. 

The extreme form, if we made further divisions, would be that 
in which A, was split up into m component single sets, each 
having m—1 0’s init. It would only be in this sense that we could 
describe the set as being the sum of its m components. 


VII. RELATED SETS OF VARIABLES 


VII. 1. Variable sets.—(i) In the earlier chapters we 
considered the manner in which determinants arose in 
solving a set of equations of the form 


(= 1, 2,...m)d,,4°-+d,,4° 4 0 4+4,4 = Fee 


and in the chapter preceding this we have considered the 
general aspects of the system under which we express these 
equations and their solution in the form 


| NAO ona od (2) 


According to the definitions we gave to the notation, X* 
and JY, are each used in different senses in the two places 
where they occur in (2): X* means ‘the elements of X*’ 
on its first occurrence and ‘each element of X*’ on its 
second occurrence; and similarly for Y,, but in the re- 
verse order. We have, however, by this time practically 
reached the stage of treating a set as a whole, so that 
we can now regard (2) as a pair of statements, one of which 
gives an expression for the set Y, in terms of the set X”, 
while the other expresses X* in terms of Y,.. The form 
of either expression determines the nature of the relation 
between the two sets. 


(ii) The special features of the particular case were that 
the X’s were unknown quantities which we wanted to find, 
that the d@’s were coefficients, more or less accidental, and 
that the Y’s were known quantities arising from the applica- 
tion of these coefficients to the X’s ; and, more important, 
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that this was merely an isolated set of equations, for which 
we had no further use when we had found the X’s. 


(iii) The relations with which we have to deal in this 
and subsequent chapters are of a different nature. We 
have a set @ and a set 25, each consisting of a number 
(the same for both) of elements, which we will call the 
A’sand the B’s. In each set the elements need not be all 
of the same kind. Each of the 4’s is a variable; i.e. 
it either has, or can (as in the theory of statistics or of 
error) be regarded as having, a very large number of 
actual or possible values. These variables, the values 
of which are algebraically independent,* together consti- 
tute the variable set @. In the same way the #’s are 
variables, and constitute another variable set 38. But 
the two sets of variables are not independent of each 
other: they are connected by certain relations, by means 
of which the 4’s are known if the 4’s are known, and con- 
versely. Thus the #’s are functions of the 4’s, in the 
ordinary sense of the word, and the 4’s are functions of the 
B’s. In this case we say that 2B is a function of A, and 
A a function of 26. But we must not only say it, but 
think it; i.e. we must treat the functional relations of 
the 4’s and the J’s rather as interpreting the nature of 
the functionality of @ and 36 than as actually consti- 
tuting this functionality. 


(iv) In the particular case we have been considering, 
A and 3B were the single sets X* and Y,, and the relation 
between them was linear; i.e. the Y’s were linear functions 
of the X’s, and the X’s were therefore linear functions of 

* By algebraical independence of m quantities we mean that 
each may have any of its values, whatever the values of the other 
m—1may be. This does not imply statistical independence, which 
is a different thing. 

2611 K 
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the Y’s. In sucha case we say that VY, is a linear function 
of X*, and it follows that X* is a linear function of Y,. 


(v) In dealing with the theory of the subject, as distinct 
from its applications, we are concerned not with the actual 
values of elements of sets but with the relations between 
the sets. Thus in the case of the linear relation 
Y, =d,,X*, where the variable single set Y, is expressed 
in terms of the variable single set X* and the fixed double 
set @,,, the elements of ¢,, form a kind of framework 


A= diy A +d.) A +d., A +...¢d 4 Ai 
A= dio A + dy. A +59 A +... +4 no A 


DN dim “A 5 dom A ae Asm “A Fee + dan A | 

into which the values of the X’s and the J’s can be fitted ; 
and what we are really investigating are the properties and 
mutual relations of such frameworks. In the present 
chapter we shall consider certain simple relations between 
two such frameworks, namely relations between the linear 
relation of one pair of sets and the linear relation of another 
pair of sets. 


VII. 2. Direct proportion of single sets.—(i) If a 
quantity 7 is a linear function of m X’s, which we will call 
A Ac a, 1b is OF the form 


Z xh, X14) X24 e+ lg A” = hy XD. (1) 


This is the simplest form of statement of a linear relation. 
Suppose, for instance, that Z is the 3rd difference of the 
X’s, formed in the usual way, i.e. 


B= OX, 
This is equivalent to 7= X*—3 A°4+3 X2—1X!, go that 


1 =—1, 4,=3, hs = —3, 4=1, hj=he=... =hy = 0. 
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The /’s having these values, the X’s may alter, but (1) will 
always give the 3rd difference. 


(ii) Now suppose that there is another set 4’, and that 
C is the same function of the 4’s that Z is of the X’s; 
e. g., as in the above example, that it is the 3rd difference. 
Then the /’s are the same, so that 


Cah d. (2) 
We can write (1) in the form 
GX hy, 


on the understanding that a suffix in a denominator is linked 
with a similar suffix on the other side and implies an inner 
multiplication. Similarly we can write (2) as 


Ca — 
Equating the two values of 4,, we have 
Cue, 
ae 2 


as our way of stating that C is the same linear function 
of the 4’s that Z is of the X’s. 


(iii) Next suppose that a set Y,* is a linear function of 
the set X’, so that each of the Y’s is a linear function of 
the X’s, Then we can take Z of (1) to be each of the 
Y’s in turn: but the sets of 4’s will be different, so that the 
relation will be of the form 


bigs = D yy X” 
If there is also a set B,, each element of which is the same 


* The set might be called either Y? or Y pi we choose the latter 
as giving a convenient symbol d,,, for the coefficient of X” (see V. 
10 (i)). 
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linear function of the 4’s that the corresponding element 
of Y, is of X’s, then 
= Ke 
B = d a A’, 


the two sets of d’s being the same. We therefore have * 


as a statement of the fact that the B’s have the same linear 
relations to the A4’s that the Y’s have to the X’s. This would 
be the case, for instance, if the Y’s were the successive 
differences of the X’s, and the A’s were those of the 4’s 
according to the same system. 

In view of the variety of ways in which we are able to 
deal with sets according to algebraical laws, it is perhaps 
permissible to describe this as a case of direct proportion, 
and to say that B, bears the same ratio to A” that Le 
bears to X". This ‘ratio’, here denoted by d,,,, is really 
the operator that is required to convert 4" into B, or X* 
into Le 


(iv) 1! the linear relation of the b’s to the d’s is the 
same as that of the Y’s to the X’s, then that of the A’s 
to the B’s is the same as that of the X’s to the Y’s 
(or, if the ratio of B, to 4“ is the same as that of Y, to 
X", then the ratio of A" to B, is the same as that of 
A" to ¥,); ive. 

a 1G y: a: Ca 


= yp, then p = >> 
AP x! ee 


[Let B,=d,,4". Then d= dB. Similarly X* = d“ Ee 
‘Therefore A*/B, = = AY/Y | 
* This expression B,/A“ must not be confused with B,/A® as 


the double set whose typical element is B,/A% The limitation in 
V. 5 (vi) excludes double sets of this kind ‘from consideration. 


(VII. 2, 1) 
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(v) The above sets of relations can be expressed by the 
diagram in Fig. 1, The crosses 
may be taken as representing 
either coefficients of the X’s 
in the values of the Y’s and 
of the 4’s in the values of the 
B's; or—in virtue of (VII. 
2. 1)—coefficients of the Y’s 
in the values of the X’s and 
of the ’s in the values of 
the 4’s. 


(vi) Ratios of the kind considered above can be com- 
bined according to the laws of ordinary algebra; e.¢. 
SEN Oia Ces oem k OSs 
a pe as (VII. 2. 2) 
The expression on the left-hand side is, of course, an inner 
product. A special ratio is 


A” 


es Vv — 
goal A” 
Example. If B,/AY” = ¥,/X", prove that B,/A*. X*/Y, = [5 


Fre. 1. 


. (VII. 2. 8) 


VII. 3. Reciprocal proportion of single sets.— 
(i) The other important class of cases is that in which the 
linear relation (or ratio) of B, to A” is the reciprocal of that 
of ¥, to A", i.e. in which, B, and A" being altered to 
BP and a 

Tee wee et A 

Dyke: p —~ “up He 

This gives 


OEY oat eg Ae 
so that ‘ 
BP OXF 
Aj Ue j 


We can call this a case of reciprocal proportion. 
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(ii) If the ratio of B? to A, is the reciprocal of that of 
Y, to X*, then the ratio of Y, to X* is the reciprocal 
of that of B’ to 4,3 i.e. 


If Bt = A" then Ay = aD (VII. 3.1) 
A, J, BP XE 


[Let BP= kA. Then 4, = &,,B?. Similarly 
Y,=4,,X". Therefore Ai) Bi Tf Xe 


(iii) The sets of relations 
can be expressed by a diagram 
as in Fig. 2, where the crosses 
represent coefficients of the Y’s 
in the X’s and of the 4’s in 
the B’s, or of the X’s in 
the J’s and of the #’s in 
the A’s. 


(iv) The inner products of the reciprocally corresponding 


sets are equal; i.e. 

p " 

Hes. 

a ee p 

[Let BP = ha yw Then XA*= ht? ig ; and therefore 

Y= hyp", Hence BY, = HP Ah, x” = hk, AX” 
=|5 4,4" = 4, x"] 


» then BPY,=4,X", (WII. 8.2) 


(v) An interesting case is that in which the ¥’s are the successive 
differences of the X’s. It will be found that in this case the B's 
are linear functions of successive sums, and therefore of successive 
moments, of the A’s. In the ordinary system, for instance, which 
is such as to give Y; =X}, ¥,=AX'=X*—X', Y,=AaaNi= 
KP an 2X? 4 XA, WO Deen ok ee Pee YY. ay 
Y¥,+2Y,+/Y;,,...3 and these give Bi=-—34, B'=4 334, 
B* = — 335 4,,..., the constants in the sums being chosen so 
that 2Am4) = 0, 22Ayi) = 0, F2SAy4, = 0,... 
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VII. 4. Cogredience and contragredience.—(i) In 
§§ 2 and 3 we have not assumed the existence of any relation 
between 4* and X* or between B, and Y,. Where a relation 
does exist, the important cases are those of cogredience and con- 
tragredience. We start with a set X*, and a set 4* which 
is derived in a definite way from X*, in other words is 
a function of X*. We then take Y, to be any linear func- 
tion of X*, and B, to be derived from Y, in the same way 
that 4* is derived from X*. Then B, is some function of 
A‘. Of the cases in which this is a linear function, we are 
concerned with two special classes :— 

(1) If B,/4* is always = Y,/X", then A’ and X* are 
said to be cogredient. 
(2) If BL/4* is always = A*/Y,, then A’ and X* 

are said to be contragredient. . 

Examples of cogredience are given in IX. 6 (x), and of 
contragredience in VIII. 3 (iv) and IX. 4 (vy). 

(ii) Instead of saying that 4* and X* are cogredient or 
contragredient, we might say that 4* in the one case varies 
directly as X* and in the other case varies reciprocally as X”. 
When we say that 4* varies directly as X*, we mean that, 
if X* is multiplied by any double set involving A, 4’ is 
multiplied by the same set: when we say that 4* varies 
reciprocally as X*, we mean that, if X* is multiplied by any 
double set involving A, A* is multiplied by the reciprocal of 
this set. 

(iii) If 4° and X* are get t and P* and 4° 

contragredient 
cogredient 


: t, then P* and X* are cogredient ; 
contragredient 


are also } 


if A* and X* are } coapedient k, but P* and A” are 
contragredient 


eer then P* and X* are contragredient. 


cogredient 
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[Suppose, for instance, that 4* and X* are contragredient, 
and P* and A* are also contragredient. Then we have 
relations of the form 
Dyes = hy 2a Q,/P* = AY/B,, 
whence, by (VII. 3. 1), 
Q,/P* = hy ep 


so that P* and X* are cogredient. | 


VII. 5. Contragredience with linear relation.— 
(i) The simplest case of contragredience is that in which 
the contragredient sets are connected by a linear relation. 

(2) Suppose that, with the notation of § 3, the relation 
between 4, and X* is 


Then, if Q@ denotes the inner product (cross-produet) in 
(VIL an 2) 
Q = A, XP = a, MEX” = a, XIX + (a). + Aq) XX? 

+ GggX* X? + (Aig + Gq) X2XP +... + Am AMA™. 
Thus Q is a quadratic in X1, X?, X°,...X™, i.e. in X*. Also, 
since each of the four sets 1", Y,,, 4,,, B* isa linear fanction 
of each of the others, Q can be expressed in a good many 
other ways, e.g. as a quadratic in B*, or in the form A*?4,, Y, 
or al” A, A, 

(2) If the relation between 4, and \* is symmetrical, 
1.8. af yp = Foy, it can be shown that we shall, in addition 
to (VII. 3. 1), have the farther relations 

Aise TA y ae 
Gas aT } = pps 
| a a ee 


(11) Conversely, suppose that we are dealing with a set 
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of quantities or coordinates X“=(X1 2... X™), and that 
we come across an expression 


Q= a Oya X, 
where «,, = a,,, Then we may construct a new set 
given by 
yes Gk 
and we shall have 
Q=A**X,. 


Now suppose we change the system of coordinates lin- 
early or replace the X’s by some linear functions of 
them. The a’s will normally have some definite meaning ; 
and this meaning, though not their actual values, will 
remain unchanged when the X’s are changed. Suppose 
that, when XA” becomes VY, X,—as based on this meaning 
of the a’s—becomes Y". Then we shall have 
bE EO GDS 
and also ; - 


i Ge Came 


Vv 


eS Y= eye 


etc. 


VII. 6. Ratios of sets generally.—(i) The word ratio 
has so far only been used in reference to single sets con- 
nected by a linear relation; if the relation between Y, and A” 
is of the form Y,=d,,, X", we call d,,, the ratio of Y, to X", 
and we call i? the feciprocal of this ratio. We can 
extend the use of the word to sets other than single sets. 

(ii) We have already had examples of a ratio which 
involves a scalar. Thus in § 2 we had relations 7= 4, X’, 
C=h, AX, and we said that 

Cae rae: Z 
ge 


Here we can quite well call 4, the ratio of C to A* or of Z to X’; 


2611 L 
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i.e. it is the ratio of a scalar toa single set. Similarlyin § 5 (11) 
the statement 


eee 
XOX Ye 

may be regarded as a statement that the ratio of Q to X” Y* 

is equal to that of Y,, to X”. In each case the ratio of one 

set to another is the set by which the latter has to be multi- 


plied in order to obtain the former. 


(iii) The more important case is that of the ratio of 
a variable set of any rank to another variable set of the 
same rank. If @ is a variable set of any rank, and 2% 
is a set which is a function of @ and is of the same rank, 
and if the relation between %5 and @ is of the form 


=a, 
where p is a constant set whose symbol contains all the 


suffixes occurring in @ and 3, then we can call p the ratio 
of 2B to A, and denote this ratio by 1B/M. 


(iv) In these cases we can continue to speak of the 
relation of 28 to @ as linear. Also, by solving the equa- 
tions, we find that there is a relation of the form 

A= p's, 
so that, if 15 is a linear function of A, then @ is a linear 
function of 26. Here, p being the ratio of 8 to M, p’ is 


the ratio of @ to 15, and we can eall each ratio the reci- 
procal of the other. 


(v) As an example, suppose that 4 and 36 are of rank 3, 
and that p is the product of three double sets, each of which 
has inner multiplication with @. Then the relation might 
be of the form 

Boo = Py bagt Aye 
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It is easy to show that in this case 

AM = gM eo, EG 
Thus the reciprocal of the product of the three double sets 
is the product of their reciprocals. 


VII. 7. Related sets of higher rank.—With the 
preceding explanation, there is no difficulty in extending 
the ideas of equality of ratios, and of related systems of 
sets, to sets of higher rank. 

Suppose, for instance, that we have a set @ which is 
a function of three single sets, and a set 38 which is a 
function of three other single sets. If the three latter sets 
were functions of the three former, 35 would be a function 
of @. The cases analogous to those considered in § 4 
would be the cases in which there were linear relations 
between corresponding sets. Suppose that 4), 14,2 are 
linear functions of U*, /*, W” respectively, e.g. 

X,=%, Ee Dig l  Le Ges 
that @ is a certain function (in the most general sense) 
of U*, VY, W*, and that 38 is the same function of 
X,, Y,, 4. Then % is some function of @. If we sup- 
pose that, when the values of @,,,4,,,¢,, are made to 
vary, 2B is always a linear function of A, and the ratio of 
3B to A is always compounded of the ratios of X, to U*, of 
Y, to V*, and of Z, to W*, each taken directly or recipro- 
cally, we get an extension of the cases considered in $ 4. 
Thus we might have 
Boo = Tlie Aan, 


so that 
Bio mAs Y, Ww” 
Bye = Ooo = OR GHZ 


each of these latter expressions representing a set of rank 6. 
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In this particular case we can say that A is cogredient 
as regards U* and V* and contragredient as regards W”; or 
we can say that it varies directly as regards U* and V" and 
reciprocally as regards W”, or that it is directly proportional 
to U* and V* and reciprocally proportional to W”. 


VIII. DIFFERENTIAL RELATIONS OF SETS 


VIII. 1. Derivative of a set.—We have now to con- 
sider the cases in which two sets vary together continuously, 
so that there can be a derivative (differential coefficient) of 
one with regard to the other, this latter being a single set. 
The derivative will in all cases be a partial one, since the 
elements of the single set vary independently. 


(1) The simplest case is that of a scalar linear function 
Z=h X= h,X14h, X24... +h, X™. 


Here 
of 
sp = h,. 
Giving p all values 1 to m, we can write this 
af , Z 


ae ae 
and we can regard /, as the derivative of the set 7 (which 
is of rank 0) with regard to the set X*. 


(ii) Similarly, if 


¥,=4,,%", 
then * 
yA" up —~ “YR 


* It should, however, be noticed that in this statement the sign 
‘=’ has not the same meaning in the two places in which it is 
used. When wesay that dY,/0X*=d,,, we mean that dY;,./oxX¢= 
dy for all values of g and r: but, when we say that Y,/X" = dyp, 
we mean that Y, = d,,X* for all values of 7. 
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(iii) A particular case of (ii) is where Y,=X. Here 
0X /oX7 is = 1 or 0 according as 7 is = 4 or #g, since the 
X’s vary independently. Hence (see (VII. 2. 3)) 

op a Ga aa pi ee 
Sigma tat eo teey G: 

(iv) Taking also sets of higher rank, and not limiting 
ourselves to linear functions, we see that the derivative of 
a set with regard to a single set X* is a set of rank higher 
by 1 than that of the original set. 


(VIII. 1.1) 


VIII. 2. Derivative of sum or product.—(i) The 
derivatives of sums and products of sets follow the ordinary 
laws of derivatives of sums and products; e.g. 

B+C) 8B , 
nN = A = Saye 
oA oA od 
2 oC 
pl) ‘ 
24s = d.d* i 


(VIII. 2.1) 


(VIII. 2. 2) 


(ii) Asa particular case of this last result, take the scalar 
quadratic form 
A ay ie a 
considered in VII. 5. Here, taking (VIII. 1. 1) into 
account, we have 


3Q OX” 
ee ry He = 

>ye = By X +4,,X ES 

This can be verified by expressing Q in terms of the 1’s and 
finding the partial derivative with regard to X% in the 
usual way. 


v Ty ‘au 
Ay X FAX” | = 2A X”. 


(iii) For an application of this, suppose that 
By AMX” = 5, XUX” 


for all values of the X’s, By taking adjoining values of 
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X*, we get the result which is expressed by differentiation, 
] (Ya 

namely 2a,,4" = 20,,X”. 

Differentiating again, or equating coefficients, we find that 


Cy = bw 


VIII. 3. Derivative of function of a set.—(i) If z 
is a function of y, and y is a fanction of x, then we know 
that 

dedy dz. 

rape mare 
and, total and partial differential coefficients being in this 
case identical, 


(ii) Now suppose that B, is a function of A*, and C? is 
a function of B,. Then, p and r being values of A and p 
respectively, we know that 
oe 2CUeB,  8Cr OB, oF OB, Ch dB, 
34? 9B, dA” 3B,dA? "OB, 0A? dB, 0A?’ 
Hence, giving p and ¢ all their values, 
20? — dCP IB, 


— = __._ VIII, 3.1 
SA er OA ( 

The argument applies to sets of higher rank. If, eg., 
we are dealing with C?’:*:, where the oc... relates to 
aspects independent of &,, then 


Mle PN NB 


—_—_ = —____ —_“. VIII. 3.2 
0 A* 0B, oA* 
(iii) As a particular case (see (VIII. 1. 1) ), 
P > Be ip 
eee ip (VIII. 3. 3) 


> BY dc? 30% 
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(iv) Suppose that the relation between 2, and A i6 
linear, say 
De =f 4, Ae sean) 
Then, replacing C” or C’ by €, we have 
XC _ rd Cy 
0 VAN OB, CoA: a 


Hence 


Thus 4* and J€/24* are contragredient. We can express 
this by saying that 4* and the operator of 24‘ are contra- 
gredient. 
(v) The determinant of 2B,/ dA” is the Jacobian of B, 
with regard to 4*; ie. 
o(B,, By, +++ Brn) a OB, 
0(A!, A*,.., A®) ~ 1349 
From (VIII. 3.1), taken with (V. 10. 5), we have the 
ordinary formula for the product of two Jacobians : 
dB, ocr | dB, Cr dcr 
9d7|" | OB, dA 0B, 242 


> (OE 


writ (VIII. 3. 5) 


VIII. 4. Transformation of quadratic form to sum 
of squares.—(i) For an example of a Jacobian, take the 
case in which a quadratic form is to be expressed as the sum 
of the squares of linear functions of the variables. Let the 
quadratic form be 

Q=a,,X"X?, (1) 
where 
Let pare 

i = ng (2) 
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be a set of linear functions of the X’s, so that 

wee ey. (3) 
Suppose that the 2’s are chosen so that (1) gives 

Oar tt. + Ly ln = FY, 2,5 (4) 
Then, by substitution from (2), 
Q = bX” bg XP = Dy gb yg XO KP, 

Hence, by comparison with (1) (see § 2 (iii)), 

Oy = Oy ql yg (5) 
The Jacobian which we should usually want to find is 
that of X* with regard to Y,, i.e. 


7m a aE A) i 


Sa Ope eyed doy oa ge 
By (3), 
ON eae 
Te = oe 
and therefore 
J= | ba" |. (6) 


But (5) gives 
ah? — BRO gPe 


and therefore 
Nae me 2 2 | 08" | OFS ice del, 

Hence, combining this with (6), 

T= |b8| = {at |}4=1/ {|agr |}? (7) 
Similarly the Jacobian of Y, with regard to X” is 
a OR eee a 1 a 
= 301, 02,... Xm) 7 = Varl = 11 ar 1} 

=1/{lar"|}2. (8) 

There are a good many different ways of expressing Q as 


in (4), but they all give the same two Jacobians. 


2611 M 


Ah 
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(ii) Hence we easily obtain the value of the multiple integral 


| | | ot ° ag Xd X?...d Xm, (9) 


™m ic ies) 
Using (|) to denote | | ...(m times), we have 


-oO 


mm 
De (|) o-#9 Ja, dY>...dXn 


ioe) 
=a) eWay... et¥m¥m Gy, 


= {| am |}4(2m)8™ (10) 
(iii) We shall require, in the next chapter, the value of 
N/D, 
where 
Ne (|) o¢ ee Waxtax®...axm (11) 


and Dis as above. We find the value of N, as we have found that 
of D, by expressing everything in terms of Y’s. By (2) and (4), 
and (VIII. 3. 1) and § 2 (ii), 


and therefore, by (3), 


D) 
Pe aA = 6b, Y5¥7, 


m 
N =", ( | ) Y,Y,e 7% ?dXdX?,..aXm 


™m 
= Jd"b,, (|) Yate. “wy.gl, ee (12) 
Let us write 
M =s0e() Yodse aviay. cere: (13) 


so that 
N = 6,,M,. (14) 
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Then M; consists of m terms due to the m values of o. Since Q is 
the sum of the squares of the Y’s, the only term which counts in 
the integration is that for which o =t. Also we know that 


lee) oO 
| rene MMay = | ehh ay, 


-0 -* 


Hence it follows that 
m 
M, = Joe (|) eo} dy, dY,...dYm 


=btD; 
and therefore, by (14), 
N/D = by, b97 
=|{. (15) 


IX. EXAMPLES FROM THE THEORY 
OF STATISTICS * 


IX. 1. Preliminary.—(i) The special feature of a statis- 

tical set 
Xi Ss(AG AGA, eA). 

of the kind which we have to consider in this chapter, is 
that each X has one only of a very large number of actual 
or possible values, which together constitute the field from 
which the X is drawn; and the fundamental facts with 
which we are concerned are the relative frequencies of 
occurrence of the various possible combinations formed by 
taking an X from each of the m fields. Thus the X’s are 
variables, and the expression for the relative frequency of 
joint occurrence of a particular set (X, X,...X,,) involves 
the X’s of the set, with certain constants. In a large class 
of cases the constants depend on the mean values of the X’s 
and the mean squares and products of their deviations 
from their respective means. It is to these cases that the 
new notation is specially applicable. It may be that some 
of the X’s are drawn from the same field; we shall proceed 
as if the fields were all different, but this does not affect 
the validity of the reasoning. 


(ii) We shall only consider two kinds of cases. 

(2) The first kind of case is where our statistical 
information relates to a large number of individuals, and 
X, X,4,... are the measures of specified attributes, such 
as height, head-length, chest-expansion, intelligence, etc., 

* Thave dealt with the problems of this chapter in as general 
terms as possible. The explanations in small print may help to 


show the statistical student the way in which the problems 
actually arise. 
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of any individual. Here the ‘variability’ of any X has 
reference to the different values that it takes for different 
individuals. The frequency of joint occurrence of particu- 
lar values of X, 1, X,... may be a complicated function of 
these values and of certain constants which are to be 
determined. 

(4) The other kind of case is that in which the ques- 
tion is one of ‘ graduation’ or ‘fitting’. Here X, X, X,... 
are observed values of different quantities (e.g. rates of 
mortality at successive ages), or are the results of observa- 
tion of one quantity at different times or by different 
observers. The ‘variability’ of any particular X lies in the 
fact that the observed value contains an unknown error ; 
and our treatment is based on the assumption that a rela- 
tion of a particular kind holds between the true X’s. 

In cases of this latter kind it should be noticed that 
the only things which we treat as variables are the errors 
in the X’s. We may take, as the typical case, the observed 
rates of mortality X, X,X,... at ages 47,¢,... If we 
denote the true rates by €,6&..., then the assumption 
which we make is really an assumption that & is a certain 
function, with constants to be determined, of ¢. So far as 
this function is concerned, € and ¢ might be called variables. 
But, for our purpose, they are not variables. We are con- 
cerned with the fixed values ¢, /,¢,... and the corresponding 
fixed, though unknown, values &, , ,...; the real variables 
are the differences between the observed values X, X, X;... 
and the true values & & &,.... 


(iii) There are two reasons why the same mathematical methods 
apply to subjects so different as relativity and statistical theory. 
One is that the number of X’s in astatistical set may be very large : 
in the second kind of case mentioned in (ii) it may be as many as 
20 or 30. The need of a condensed notation is therefore even 
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greater than for relativity, where the number of dimensions does 
not exceed 4. The other reason is that, as in the relativity theory, 
we are, to a certain extent in the first kind of case, and very 
largely in the second kind of case, concerned with sets constructed 
from the original sets by means of linear relations. 


(iv) We denote the mean product of the deviations of 
X, and X, by (X,.4,), or, more briefly, by 7g, i.e. 
Sor = (Xq-X,) = mean value of 
(X,—mean X,) (X,—mean 4,). 
It must be clearly understood that, though (4,. X,) 
depends on g and 7, it relates to the complete fields 
from which X, and X, are drawn, and, for any par- 
ticular values of g and 7, is not a variable, like 2 and X,, 
but a constant. 


(v) Although we have defined (X,.X,) as a mean value, 
our dealings with it ultimately depend on the algebraical 
laws which it follows. These are, first that it satisfies the 
ordinary laws of multiplication of two expressions X, and 
X,, le. that, ¢ being a constant as regards the 1’s, 

(X, v x,) = (x, 2 X,)s (X, : (x, = X,)) 9 (A, : X;) 53 (X, : X,), 
(X,.cX,) = ¢(X,.X,);  (TX.1.1) 
and next that, if wis any linear function of the \’s, then 
(w.w) is positive unless u = 0, i.e. that 
(u.u)>O0 if wu 0. (IX. 1. 2) 
It is clear that (w.«) = 0 if w= 0; for (IX. 1. 1) gives, by 
putting c= 0, 
(X,.0) = 0, (IX. 1. 8) 
whence (~.0)=0 follows by means of (IX. 1. 1) (ef. (vi) 
below). These are the only properties we shall use; and 
our results will therefore be true for any meaning of 
(X,-1,) that: satisfies these laws, provided, of course, that 
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(X,.X;) is a constant as regards all the X’s. And, con- 
versely, we shall only be dealing with sets for which 
(A,-4;) has a meaning and a value for each value of g with 
each value of 7, and satisfies these laws. It will be assumed 
that the values of (4,-4;) are known; or, at any rate, that 
our results are final when expressed in terms of these 
values. Whether we are dealing with mean squares or 
mean products or not, we can call (X,.X,) the (.) of 4, 
and X,. 

In the first kind of case mentioned in (ii) (X, . X,) would usually 
be the mean square of deviation of X, from its mean, i.e. would 
be the square of the standard deviation; and (X,.X,) would be 
the mean product of deviations of X, and X,. from their respective 
means, and would therefore, in the case of normal correlation, be 
equal to the product of the standard deviations of Xy and X,. 
multiplied by their coefficient of correlation. In the second kind 
of case (X,.X,) is the mean square of error of X,, and (X,.X,.) is 
the mean product of errors of X, and X,. 


(vi) It follows from (IX. 1, 1) that 
(a,X,.X,) =4,(X,.X,), (a,%,.9,X,) = 4,0,(X,,- X,). 
(1x14) 
(vii) If X, is a set of the kind considered in this section, 


then so also is any other set which is a linear function 
of X,. Suppose, for instance, that 


YR =P X,. 
Then, by (IX. 1. 4), 
OP IS rN) = T(x, Xs) 
which has a definite meaning for each value of g with each 
value of 7, and can be shown to satisfy the laws stated in (v). 


(viii) Our results are also subject to the condition that 
none of the determinants of the double sets we have to 
deal with are 0; 1. that (Aq . X,,)| # 0, whether the range 
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of values of g and 7 is the whole of the range 1 to m or 
a part of it only. 


IX. 2. Mean-product set.—(i) The quantities (X, . X,) 
constitute a symmetrical double set 
t, p = Sou = ((X,.%,) (X,.%_) (Xy- 42). A Xin) 


B 7 
(X,. 1.) (X,. Xs) (X, . Xs) te (XQ. 4 =) : (IX. 2. A) 


(X,. Aan) (X, ; AG) (X; I) sere ee : Da) 
We call this the mean-product set. 


(ii) Corresponding to this there is a reciprocal set 
fr? = fPY wiven by 
e A x» 
LS = Lon 2 dp HT I =p 


(iii) If (X,.X,)=0, we can for the purpose of this 
chapter describe X, and X, as statistically independent. 
Strictly speaking, this is a loose description, since the 
complete statistical independence of two variables X,, and 
X, would imply a good deal more than that the mean 
product of their deviations from their respective means 
should be 0. But we are only concerned, here, with mean 
squares and mean products. 


(iv) The simplest class of cases—from the point of view 
of algebraical treatment—consists of those cases in which 
the \’s are statistically independent of one another and the 
mean square of deviation of each Xis 1. We can express 
this by duplicating the set, thus: 

Ne che Mn ee 

Deo fg. een 
and saying that the (.) of corresponding elements of the 
two sets is 1, and that the (.) of elements which do not 
correspond is 0, 


nme 


mr 
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For a set of this kind we have 


Sor (X, A,) = Ez 
so that the mean-product set is the unit set. It follows 
that in this class of cases the mean-product set and its 
reciprocal set are identical. 

(v) The next kind of case, in point of simplicity, is that in 
which the X’s are statistically independent of one another but the 
mean squares of deviation are not all 1. This would be the case, 
for instance, if the X’s were independent observations, not all of 
the same weight, of a single quantity. For practical purposes 
a case of this kind can be brought under (iv) by expressing each 
X in terms of its standard deviation (square root of mean square of 
deviation) as the unit. 


(vi) There is also an important class of cases in which the X’s 
fall into two groups, such that each X in one group is statistically 
independent of each X in the other group. If, as in VI. 11 (i), we 
denote the two groups by 4, and A4, then the property is that 


(4g. Ag) = 0. 


IX. 3. Conjugate sets.—(i) When a set X, is not of 
the simple kind described in § 2 (iv), we shall find it useful 
to introduce another set X* which (1) is a linear function of 
X, and (2) is such that, if we place the sets opposite one 
another, thus: 

Ay xX, X; 28 a n 
Re A Ae 


the (.) of corresponding elements of the two sets is 1, and 
that of elements which do not correspond is 0. This new 
set X* is said to be, conjugate to X,. 
(ii) The second of the above conditions can be written 
(ane y= |%, (IX. 3,1) 


2611 N 


98 Conjugate sets IX. 3 (ii) 


or 
(ea (IX. 3. 2) 


(iii) Each element X? of the new set will contain m 
terms, with m coefficients which have to be determined 
from the m equations given by 


(XP. X,) =f. 
There are altogether m? equations to determine \*. By 
regrouping these according to the values of » in (X*. X ne 
we see that if X* is conjugate to X, then X, is conjugate to 
X*. This is in fact evident from the symmetry of (IX. 3. 2). 


(iv) To express X* in terms of X,, or X, in terms of 1%, 
let us first take W to be any linear function of X,, say 
W = ahX,. (1) 
Then we want to find an expression for a". 
As we know the value of (X*. Y,), we take the ( . ) of 
W and X*, By (IX. 1. 4) and (IX, 3. 2) we find that 
CHexe) = (aX, aor) = al (X,,. X*) === a" | =a’, 
whence 
aa (0 37); 
Substituting in (1), 


W=(W.X)X,. (IX. 3. 3) 
Taking W to be each element of X* in turn, we have 
Dae Oe X") Xie. (IX, 3. 4) 


We do not yet know the values of (X*. 1“). But, if we had 
started with WV as a linear function of X*, we should simi- 
larly have got 

Wa (WX) x, (IX. 3. 5) 
whence 

X, = (X,.X,) X" (IX. 3. @) 


% 
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Writing this in the form 


X= Ayr" (IX. 3. 7) 
we have, by reciprocation, 
Xt = f'"X,, (IX. 3. 8) 


which gives A” in terms of X,. Further, comparing 
(IX. 3. 8) with (IX. 3. 4), we see that 

1) en af (TX. 3, 9) 
and, of course, the converse also holds. This result is 
dependent on the assumption, made in § 1 (viii), that 
| Sor | is not 0. 


(v) We could have obtained (IX. 3. 4) and (IX. 3. 6) in 
fewer steps by considering the set as a whole instead of 
element by element. If we assume 


X=a"X,, 
then we get i 
(X* xX) = (a ¥ Ae AV =a”, 
so that 
a = (X*, X#), 
This gives (IX. 3. 4); and (IX. 3. 6) can be obtained in 
the same way. 


(vi) We can write (IX. 3. 3) in the form 


W/X,, = (W.X*); (IX. 3.10) 
and similarly from (IX. 3. 5) 
Vi (WX). (IX. 3. 11) 


(vii) If the set A, is of the special kind considered in 
§ 2 (iv), ice. is such that 
(4,4) =I7) 
then : : 
X, = (4,.4,) A” = pee = x’, 
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so that the set is identical with its conjugate. The set is 
said to be self-conjugate. 


(viii) In a case of the kind mentioned in § 2 (v), where Snqg= 0 
if g#p, but fy is not necessarily =1, it may be shown that 
SPP =1/fyp, and fri = 0 ify Fp, so that XP = Xp /foy. 


(ix) Next consider a case of the kind mentioned in § 2 (vi), 
where X, consists of two portions, the elements in each portion 
being independent of those in the other portion. As before, we 
take one portion to consist of the first & elements, and the other of 
the remaining m—k, and we denote the two portions by X, and 
Xy. Then the special property is that 


Fag = (Xa: Xy) = 0, (1) 
whence, as in VI. 11 (iii), it follows that 
f°? = 0. (2) 


Breaking up the right-hand side of (IX. 3. 7) into two portions, we 
get, according as A belongs to the first or to the second portion, 
the two separate results 


Xa =SayX7, Xy = FyyX?. (3) 
Similarly, from (IX. 3. 8), 
Xta fl, XP = fev Xy- (4) 


In finding f*7 and f?¥ from J rw: it is (see VI. 11 (iii)) immaterial 
whether we take the set 7), as a whole or the sets Jay and Foy 
separately, so that (4) may equally well be written (see VI. 11 (i1)) 


Xt = (f°), X,, XP = [FP m4 Xy. (5) 


IX. 4. Conjugate sets with linear relations.— 
(i) Let ¥, be any set which is a linear function of X,, and 
therefore of A"; and let Y? be its conjugate set. Then, 
taking W in (IX. 3. 10) to be each element of ¥, in turn, 
we have 

a Y,/%X, = (¥,.X"). (Tx 
Similarly 
Y,/X# = (¥,.X,), YYX,=(¥.X"), Y/H = (¥P.X), 

(IX. 4. 2) 
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(i) By combining ratios, we get such results as 


(2. 2") = B,/X,, = #,/¥,. Y,/%,, = (E,.¥)(¥,.X"). 


(IX, 4. 3) 
(i11) To find Y?, suppose that 
P= Py Xp (1) 
Let the conjugate set be 
ee eee ae 
Then ms 
fe = Ie.. 5) — Care Ge boy x,) = Leola = kouPou 3 
whence, by reciprocation, 
hay = BRIE = OPM, 
Thus the conjugate set is 
jhe ts Os. (2) 


(iv) Similarly, if 
fen Y. = bey “yp dng Xo, 
Yé = [AGI OE. 
etc.; in other words, the conjugate of an inner product is the 
! ees ae of the factors. [Let us write 
| conjugate 


CEH 167 619 GK. 


inner product of the 


Then 
(Cf. ¥,) = BID c7 ed ™ dog (X*. Xa) 
€ 


since (X*.X,)=]|4. Hence ae 
CS YS; 

We might, alternatively, have deduced this from (iii) by means of 

(VI. 10. 2).] 


(v) From (IX. 4. 1) 
DO Sm (lee) SAY iA 
and therefore, by VII. 3 (iv), 
= A (IX. 4. 4) 


Thus conjugate sets are contragredient (VII. 4); and the 
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inner product of a set and its conjugate is the same for 
all linearly related sets. If we denote this inner product 
by Q, then the sets X,, 4", Y,, Y* are connected by four 
relations of the form 

Ay _ YF; Q 


p=yp=.-¥) =x pe (IX. 4. 5) 


We can express Q in such forms as 
ea CP Cale A Che) Gy 9.0) ar (PX) Aus 
the last of which, when written out in full, is 
SMX, + 2 FP KM, +f XoXo + APP AAS + oe tf Nig Ans 
or in more general forms such as 
Q=(%,. D)GD,, 
where C, and JD, are any linear functions of X,. It must 
be remembered that the invariability of Q only applies for 
the particular values 4,, X,,...X,,. If there were a dif- 
ferent set of \’s there would (in general) be a different Q. 


IX. 5. The frequency-quadratic.—(i) In most of the 
cases We are considering, whether of the first or of the second 
kind mentioned in $ 1 (ii), the frequency of joint occur- 
rence of values lying within limits 

X, + $a4,, X,+3d4X,,...X4m +304, 
is proportional to 
eax dX. aa 

where, if 2,2 ,... 2%, are the deviations of X,,X,... 2, 

from their respective means, P is of the form 
PZaa,0, + 2a%a, 2, +a%a,.0,+2a3aa,+...+0"™2, 2%. (1) 

In our notation this becomes 

P=a™n, a, ; (2) 
where 
a! = att*, (3) 
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(ii) Let us write 
EX = aa = a a, ta at... 0 tm 
— 49P/ O2,. (4) 
Then it can be shown (see VIII. 4 (iii)) that 


ats 
Dp ae a 
mean value of LPx, e ees | i (5) 


Taking (B, : C,) to mean, for these cases, the mean product 
of B, and C,, it follows from (5) that E* is conjugate to 
By, 1. e: 

BX = a, (6) 
Henee, by (4), 


and therefore, by (IX. 3. 10), or by comparison with 
(IX. 3. 4), 
a => e/a, => (a . a), (7) 


(ii) Thus the a’s in the expression for P given in (1) 
are the mean squares and mean products of the elements of 
the conjugate set. Similarly, if we expressed P in terms 
of the conjugate set, the coefficients would be the mean 
squares and mean products of the elements of the original 
set; 1.€. 


pe 


a NY = Ons poe agit 

an 0 = 0, = My, 0% | 
és il pal ee 2 2 Mes 
=A, UU $2 G2" 4 Ay EL" +20. UF oF Ag OU 


where 
ay ae (2) .@,). (9) 


(iv) Take, for example, the case of two quantities X,, X,, whose 
standard deviations and coefficient of correlation are c,, ¢,, and ~. 
Then it is well known that 


gh @\k, , Laity 
p= (2% aa Vie 
nip Sear ees 
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This gives, for the members of the conjugate set, 
= Ly 1h," 1 4 1 (-M 4%) x A ° 
* G Ce ) > ¢,(1—rr) eae €¢, t/ (l=) 


It is easily verified that the mean square of x! is equal to the 
coefficient of x,x, in P, and so on. 


(v) If we express the z’s linearly in terms of a new set 
Y, the value of P will remain unaltered. We can put this 
differently as follows. Suppose that y, is any linear func- 
tion of a. Let 4,,=(%,.y,)3 and let 2“ be the reciprocal 
of f,,. Then, if we write 
y = jh? y p? 
we shall have 

PI, = P= any. 

(vi) The (w,.@,) or (,.,) which we have so far been 
considering is the mean square of #,, or the mean product 
of x, and #,, without regard to the values that each of the 
other zs may have; i.e. the mean square or mean product 
taken for all possible values of these other 2’s according to 
their relative frequencies. We may also want to know 
what happens when some of the 2’s have definite values 
ascribed to them and are not allowed to vary from these 
values. In these cases we follow the principle of VI. 11 (ii). 
Suppose that all the a’s after x, are fixed. Let the a’s up 
to x, be denoted by a, or xg ete. Then our methods apply 
to the set of order / formed by these a's. The principle, 
therefore, is as follows. Suppose we want to study the 
variation of the 4 quantities #, when the m—4 quantities 
a, are fixed. We first construct the mean-product set /,,, 
(=a,,) for the m quantities v, and x, ; then construct the 
reciprocal set f*? (=a"?), the elements of which are the 
coefficients in the terms in P ; then take out the partial set 
J*” corresponding to @,; and then find the set ( Sk Which 
is the reciprocal of #°%. The result is the mean-product 
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set of x, when 2, is fixed. (This is a well-known theorem, 
but is usually expressed in terms of determinants.) 


(vii) If the partial sets 7, and @, in (vi) are independent, 
so that (v7, +%4)=0, the elements of the mean-product set 
outside the portions corresponding to (x, .#,) and (”,.2,) 
will all be 0, and (see § 3 (ix)) the values of (f,,);, will be 
the same whether we construct them from the whole set 
or from the partial set. This is otherwise obvious ; for, if 
the a’s of w, vary independently of those of w,, they vary 
in the same way whether the latter are fixed or vary. 


IX. 6. Criteria for improved values.—(i) Our next 
problem, considered in this and the following section, is that 
of reduction of error, in a case of the second kind mentioned 
in § 1 (ii). We have a set of quantities 

De (DT Dee D) 
which contain errors ; the mean products of error being 
dy, = (D,.D,). (IX. 6. A) 
The whole set J, consists of two portions 
D,= (PD, Dy. Dy), Dy =(Putr Dysa»Dn)- 
All the D’s are the results (direct or indirect) of observa- 
tion ; but the true values of the DJ, are negligible (within 
the degree of accuracy to which we are working), and, if U 
is any one of the D’s, or any linear function of them, we 
can add to it any linear function* of D,, without altering, 
except to a negligible extent, the true value which it repre- 
sents. Ifthe sum of U and an indeterminate linear func- 
tion of D,, is represented by U’, the problem of reduction of 
error is to determine this linear function so as to make 


* This is, of course, an incomplete statement. We could replace 
U by any function of Uand Dy which would be equal to U if the 
Dy were all 0. But we are only considering linear functions. 
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(U’.U’)a minimum. The resulting value of U’ is called 
the improved value of U, and will be denoted by JU. The 
elements of D, are called the auxiliaries. We can replace 
a by 8, y,.--+, and ¢ by x, W,..., as occasion requires. 


(ii) The way in which this problem arises is as follows. We 
start with a set of observed quantities X,, X,,...X,,, which 
correspond to a series of values of some other quantity ¢ at equal 
or unequal intervals ; the X’s might, e.g., be rates of mortality at 
different ages. The X’s contain errors; and our fundamental 
assumption, based on general experience and on inspection of the 
particular data, is that the true values are so nearly of the form 
(in ordinary notation) ¢)+¢,¢+¢,0+...+¢,_,¢" that their 
differences (divided differences if the values of ¢ are at unequal 
intervals) after the (k—1)th are negligible. We may therefore add 
to each X any linear function of these differences, which are what 
we are calling Dy.,, Dy4o,...D,,. The problem is to determine 
the coefficients in this linear function so that the mean square of 
error of the sum of the X and the linear function shall be 
a minimum. 


(iii) We have first to see what relations hold between 
the two portions of J, and the two portions of its con- 
jugate set when similarly divided. Denoting the con- 
jugate set, as usual, by D*, let the two portions be 

i (Di D* . D, i (Dé+1 DE De. 
Here it is to be observed that D* is not (in general) 
the set (order 4) conjugate to the set D, (order 4), 
since each element of it is a linear function of the 
whole D,; and similarly for D?. Now the condition of 
conjugacy is 

(DP, D,) = la 
But, if D’ and DY, belong to non-corresponding portions 
of the two sets, g cannot be equal to py. Hence we get 
the relations 
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(D* . Dg) = |§...(1), (D* . Dy) = 0...(2), 
(DPLD,) =0.5.(8), (D*.D,) =| 0.5.48) 


(iv) First let U be an element of D, or a linear func- 
tion of D,, say a?D,. Then its improved value must be 
VV) ie, 

I(a?D,) = 0. (IX. 6. 1) 
For this makes (U’. U’) = (0.0) = 0, by (IX. 1. 3); and, by 
(IX. 1, 2), (U’.U’) would be > 0 if U’ were not=0. Hence 
(U’. U’) is a minimum when U’ = 0. 


(v) The next most simple case is that in which U is an 
element of D* or a linear function of D*, say 


U=a,D". (1) 
Let the value of U’ be 
U’ = U+u, 
where 
“= aD,.. (2) 


Then, by (IX. 1. 1), 
(U’. U’) = (U.U)4+2(U.u)+(u.u). 
But, by (1) and (2), and by (2) of (iii) above, 
(7) = a,a° (D*. D.) = 0. 
Hence (U’. U’) is a minimum when (w.~) is a minimum ; 
and this, by (IX. 1. 2), is when w = 0, so that 
UGeaUs 
Hence the improved value is the same as the original value ; 
i. €. 
ES IAG eed De (IX. 6, 2) 

(vi) The simplicity of the results obtained in (iv) and (v) 
suggests that we should in all cases regard U as expressed 
in terms of D* and D,. There is no difficulty about this, 
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in theory, whatever linear function of the D’s U may be. 
If, for instance, U is given as a linear function of D,, then 
we obtain our result by eliminating D, (4 values) between 
this formula for U and the & equations which give D* in 
terms of D,, i.e. in terms of D, and D,. Suppose then 
that 
U=V+P, 

where 

Vi WW = OD 
Then U’ is formed from U by adding some linear fune- 
tion of Dy, so that 


Ula PEW 
where V =, D® as before, and W’ is of the form 
WV" = end 
Hence 
(OU) = (VP) +2 ory ah a 
But 


CH ) =O c? Ds) —O en Cie D,) = 9, 
by (2) of (iii). Hence 
(FU SF PCO 

But this is a minimum when //’ = 0. Hence 

I(6,D* + 5° D,) = 6,D*. (IX. 6. 3) 
In other words, if we express UV in terms of D* and D3, 
the improved value of U is found by omitting the part 
involving D, 


(vii) Since, by (IX. 6. 3), ZU is a linear function of D%, 
and, by (2) of (iii), (D*. D,) = 0, it follows, by (IX. 1. 4), 
that 

(1U.D,) = 0. (IX. 6. 4) 
In other words, the (.) of any improved value and each 
of the auxiliaries is 0. 
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(viii) It also follows from (IX. 6. 3) that if two quantities 
differ by a linear function of the auxiliaries they have the 
same improved value. 


(ix) By taking U in (vi) to be each member, in turn, 


of a set 
By = b,,D° +09 D,, 


we find that 
IB, =1,. D*. 
Also s = 
TER) = I (6, Di +h 0? Dy) = eRe = Dee 


(IX. 6. 5) 
i.e. the improved value of any linear function of the B’s is 
the same linear function of their improved values. 
(x) Altering & in (IX. 6. 5) to ls, and writing C,= 
ee B,, we find that 


IC C 
JB a le IX. 6.6 
Gah tes, ( 


i.e. the improved values of two linearly connected sets are 
related in the same way as the original sets; or, more 
briefly, a set and its improved values are cogredient. 


(xi) Since we know that the improved values of D 
are 0, we have really only to determine those of & other 
quantities. In view of (IX. 6. 5), we can choose these to 
be any linear functions of D, that we like, with or 
without linear functions of D, added; and similarly we 
can replace D, by any linear functions of D,: provided, 
in both cases, that none of the mean-product determinants 
are 0. The functions so chosen can be called J, and 
D,, so that we need only consider the problem of find- 
ing JD... 

(xii) The result stated in (IX. 6. 2) gives us the extension, to the 
general case in which the errors of the original observations may 
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have any mean squares and mean products, of the ‘method of 
moments’ ordinarily applied to the case of a self-conjugate set 
(§ 8 (vii)). We take X,, as in (ii), to be the original observations, 
and D, to be their differences of successive orders. Then we have 
found that the improved value of any linear function of D* is the 
same as the original value. But, by VII. 3 (v), D', D’,... D® are 
successive sums of the elements of X*, the set conjugate to X, ; 
and the first & moments of X* are linear functions of these sums. 
Hence the improved values of these moments are the same as their 
original values; and this, by (ix), is the same thing as saying that 
the moments of the improved values of X* are equal to the 
moments of the original values. Thus the method of moments 
still applies. But it should be observed that it does not apply to 
the original set of observations, but to the conjugate set. 

As a simple example, suppose that X, is a set of independent 
observations of a single quantity, the mean square of error of X, 
being ¢pc». Then (§ 3 (viii)) the conjugate set is 

( xX, xX; Xm . 


ee eeee 


C1Cy Cylo Cem 


As the X’s will all have the same improved value, which we will 
call JX, there is only one moment to be considered, namely, the 
Oth moment, or sum, of the conjugate set. Hence, equating the 
sums of original and of improved values, 

bees! Be PME Oo Tx 
en ee eo = = +— 


Gy Cy Cy mem 4% Cole nem 


’ 


which gives the familiar result. 


IX. 7. Determination of improved values.— 
(i) From the results obtained in the preceding section we 
deduce three methods of finding the improved value of any 
element of D_, say Dy. 

(1) We can express D; in terms of D* and D,, and then 
omit the part involving D,. The result is 1D;. 

(2) We can say that J is some linear function of D*. 
This linear function has & coefficients to be determined ; 
they are determined by the condition that, if the linear 
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function is expressed in terms of D,, the coefficient of 
D, is 1 and those of other elements of D, are 0. The 
practical application of this method depends on the cir- 
cumstances of the particular class of cases. 

(3) We can say that /D; is obtained from D, by adding 
a linear function of Dy; which we have called — 4? Dy. 
This linear function has m—Z coefficients to be determined. 
We have found in (IX. 6. 4) that (LD;.D,)=0; this gives 
m—k equations, from which the coefficients in question can 
be determined. Thus ¢he necessary and sufficient conditions 
jor ID, are that it dufers from Dy by a linear function of 
D,, ant that (ID. Dy) = 9. 

These three methods are exhibited in (ii), (iv), and (v) 
below, and a fourth method is given in (vii). 


(ii) To apply the first method, let us write 
Dees ep)? +e D,,. 
We do not need ¢, since the only part of D, that counts 
for the improved value is ¢,,D°; we therefore get rid of 
e*? at once, by means of something whose (.D,) is 0. 
This, by (2) of § 6 (iii), is DY’. Taking the (DY .) of both 
sides, we have 
CDE NN a AOI ID NES. fh 


Also, by (1) of $6 (iii), 
(DY. D,) = |i. 
Hence 
a? Teg = LY: 

Here a, 8, y relate to the partial set (4, 4, ... A,), 
and the statement is limited to this set. Dealing only 
with this set, let us denote the reciprocal of d®’ by 
(4ey)z3 this, as pointed out in VI. 11 (ii) (cf. $5 (vi) of the 
present chapter) is not ordinarily the same thing as d,, as 
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obtained from the whole set of order m. We have then, 
by reciprocation, 

Cap = (dp.)xla = (gq) Ke 
Substituting in the expression for D,, and dropping the 
en? D,, in order to get the improved value, we have 


ID, = (dgq),D”. (X72) 


(iii) Although, in the above, we have not needed ¢**, we 
ought to find its value in order to satisfy ourselves that, 
as has been stated in §6 (vi), any linear function of D), 
say J,D,+ 94D, can be expressed as a linear function of 
D* and D,; to do this, it is only necessary to prove the 
proposition for D,, since the formula for 9,D,+9,D, will 
follow at once. 

We have written 

D. => Cap D +e? Ds, 
and have found e,,. To find e“?, we must get rid of the 
first term ; so we again use (2) of $6 (iii), getting 

(D,,.D,) = & (D,.D,) 
. _ ad 

or dae ae dys 
Hence, by reciprocation, 

28 = [d?x] m-k4ay 
where [d?*],,,_), is the reciprocal of d gy Obtained from the 
partial set (Dz.4, Dy. .++ Dy). The complete expression 
for D, is therefore 

Dy = (Aga) ¢P? + [On edayDy. (TX. 7. 2) 

The existence of (dg,);, and [d?*],,_;, is dependent on the 

assumption that the determinant | d,, | formed for D,, 


and the determinant | ¢,,. | formed for Dy; are both 4 0 


ay gE 
(see § 1 (vin)). 
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(iv) To use the second method, we might have proceeded 
as follows. We write 


Le Lela 
To find e,,, we express D® in terms of D I 1,6. “of D and Dy; 
by means of (IX. 3. 4), and we have 
LD een Ga) 
= eg D. +e,g0? Dy. 
From the condition stated in (2) of (i), it follows that 
eqpt” = 2, 
and therefore, by reciprocation, 
Cag = (“pyali = Gea) k- 
Hence we get the same result as before, namely, 


ID, = (d,,), D*. 


(v) For the third method, we write 
ID, = D,-€? Dg, 
and we have to find e*?, The condition stated in (3) of (i), 
namely, 
(ZD,.Dy) = 
gives 
Gea Dye DD ed. 
This is true for all the m—4 values of y. By reciprocation 
oe a [ax] m—k day . 


This agrees with (IX. 7. 1) and (IX. 7. 2). As m—A will 
usually be a good deal greater than 4, the method is rather 
of theoretical than of practical interest. 


(vi) The elements which we have found to be important 
in the above processes are the m—4 auxiliaries D,, whose 
improved values are all 0, and the & elements D* of the 
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conjugate set which correspond to the remainder of D,. 
These elements together constitute a set of order m; and 
we have in fact, in (iii), expressed D, in terms of this 
set. As the set is important, it is worth while to see 
what is its conjugate. 
We write 
P=aDsé& Ds; 

where the ‘&’ means that the elements of the two sets of 
orders 4 and m—s are combined to form a set of order m. 
These two partial sets are statistically independent. It 
follows, by § 3 (ix), that the set conjugate to E’ is 


Ey = (dg), DP? & [2 m-14Dy- 


(vii) But (d,g and d,, being identical) we have already 
found that 
ID, = (dy) 7,D*. 
Hence we get a concise formula for finding 7D,. Let the 
set conjugate to D, & D, be D* & D®; and let the set 
conjugate to J" & D, be F, & FY, Thn =i; 


(vill) Since JD, is of the form D,—e? Dy, and JDz is of 
the form €gD", and (D’. D5) = 0, it follows that 
(LD) LD) = (D,.1D,), 
and similarly 
(ID, ID) (ID; Dz). 


[NorE.—This chapter is based on (1) a paper by myself in Phil. 
Trans. (1920), ser. A, vol. 221, pp. 199-237, in which the old nota- 
tion was used ; (2) a paper by Professor Eddington in Proceedings 
of the London Mathematical Society, ser. 2, vol. 20, pp. 213-221, show- 
ing how the notation and methods of the tensor calculus can be 
apphed, and making some abbreviations and improvements in the 
work; and (3) a note by myself, following the above, ibid., pp. 222- 
224, I have altered the notation a good deal.] 


X. TENSORS IN THEORY OF RELATIVITY 


X. 1. Preliminary.—(i) Tensors are sets* which (1) are 
functions of a set of co-ordinates (7, 7, v,...) and (2) are 
subject to certain conditions of transformation when the 
co-ordinates are transformed. 


(ii) In the theory of relativity there are four co-ordinates 
(2, @, #3 4), So that all the sets are of order 4, and any inner 
multiplication with regard to a suffix p involves addition 
of the products for the values 1, 2, 3,4 of 4. But this fixing 
of the number of elements in a set does not affect the 
general reasoning with regard to the sets, and we can 
continue to treat them as of order m. 


(iii) In VII. 4 we started with a set X*, and a set A* 
which is a definite function of X*, and we supposed 4* to 
be changed as the result of X* being changed by linear 
substitution; and the cases we considered were those in 
which, throughout all such changes, 4 varies either directly 
or reciprocally as X*. In VII. 7 we extended the inquiry 
by taking a set @ to be a function of two or more single 
sets, and considered cases in which, when these sets are 
changed by linear substitution, @ varies directly or reci- 
procally as each of the sets. For tensors we have tc 
consider cases in which the primary substitutions are not 
necessarily linear. If in place of the original set of co- 
ordinates 7, we take a new set 2’,, which is a function 
but not necessarily a linear function of #,, the ratio which 


* It must be remembered that the elements of a set are not 
necessarily numbers, but may be quantities; and that a set as 
a whole is something different from its elements. What we 
usually mean by a tensor is some physical phenomenon represented 
by a set: but no confusion arises if we call the set itself a tensor. 
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we have now to consider is not the ratio of #, to # but 


the ratio of their differentials, i.e. the partial differential 
coefficient da’, /dz,. When the substitution is linear, this 
is equal * to x /e,, so that our treatment of the general 
case is consistent with our previous treatment of the par- 
ticular case. 

We will begin with the single set, and then go on to sets 
of higher rank. 


(iv) In the case of sets of higher rank, we sometimes 
have to deal not only with inner products but also with 
inner sums. By the inner sum, in such an expression as 
At, we mean the result obtained by replacing p by o 
and summing the values for c= 1, 2, 3,4. It will be seen 
presently (§ 5 (iv)) that, as the result of the particular 
notation adopted, the inner products or sums have only to 
be considered when one of the two letters concerned is an 
upper suffix and the other is a lower suffix. 


X. 2. Single sets (vectors).—(i) Beginning with single 
sets, we start with a pair which we call z, (the set of 
co-ordinates) and 4’, or #, and 4*; 4* or A* being a fune- 
tion of v, or x, (The a here, like the A, denotes a complete 
set, not, as in Chapter IX, a partial set.) Connected with 
these, or arising out of them, isa plurality of pairs 2’, and 4”. 
But our purview is limited to the cases in which the relation 
between 4* and 4” is linear, and in which, further, this 
linear relation is of one of the two following kinds: 

(a) where 


* The difficulty mentioned in the note to VIII. 1 (ii) does not 
arise, because 0x’,/da) does not occur absolutely, but (directly or 
reciprocally) as one of the factors in inner multiplication with 
regard to A or p, 
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(2) —replacing 4* and A” by A, and 4’,—where 
AR DSns 
Ai ee” 
In the cases under (2) «* is said to be a contravariant 
vector ; in the cases under (4) A, is said to be a covariant 


vector. Here ‘vector’ is used as meaning a single set 
which is a tensor. 


(11) It will be seen from VIII. 1 (ii) that, if the relation 
between x, and #’, is linear, these become respectively 


a 4, 
er! 
ety 
Aaa 


so that in these particular classes of cases A’ is contravariant 
: x c ; ; é 

if 4° and #, are cogredient, and A) is covariant if A, and x, 
are contragredient.* 


X. 3. Other sets.—(i) For sets of higher rank, we are 
similarly concerned with pairs of partial derivatives 


dig Oa" 
etc.; and a set depending on w,,#g,... is not a tensor unless, 
when #,%g... become 2’,2’,..., the ‘ratio’ of the new 
value of the set to the old value is the product of these 
partial derivatives, one from each pair. The particular 
derivatives are indicated by the position of the letters 
af... or py...: these are upper suffixes if, so far as the 
particular variable is concerned, the relation is of the 


* Tt seems desirable to call attention to these classes of cases, as 
otherwise the tensor terminology may be found rather confusing. 
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contravariant type, and lower suffixes if the relation is 
of the covariant type. Thus for double sets (tensors of the 
second rank) we should have such relations as 


hE / da” : 
ea EES (contravariant tensor), 
AB 9, 9% 


A, 28, Ip 
Ain ey oz, 
AX du, 4 


, 
=.—* —* (mixed tensor) ; 
AB On, 0%, 


(covariant tensor), 


and for a tensor of higher rank we might have such a rela- 
tion as 

Ay _ 2, By My de’ 

a on, On" oa", dz, 
Two tensors @ and 36 are said to be of the same character 
if the ratios A’/A and 38’/% are of the same form. 


(ii) For a scalar function of a set or sets the above 
condition becomes 


a. 
so that a scalar (in the general sense) is not a tensor unless 
it remains constant for all changes in the system of co- 
ordinates. Such a function as 4, + 4,+ 4, + A,, for instance, 
would not usually be a tensor. For tensor purposes, there- 
fore, ‘scalar’ practically means ‘invariant’. 


X. 4. Reason for limitation.—The object of limiting 
the definition of ‘tensor’ in this way is to ensure that the 
result of any number of steps, all of the same kind, pro- 
duced by successive transformations of co-ordinates, shall 
be the same as if we had passed in one step, also of the 
same kind, from the initial set of co-ordinates to the final 
set. That this is in fact ensured is seen from the properties 
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of the partial derivative of a set. Suppose, for example, 
that 


A*® 23a; 

a So? ”) 
and that : 

on = ae (2) 

: a 


Then, by (VII. 2. 2) and (VIII. 3. 1), 
MG eran. OER On. 
LAAT ee One 

which is of the same form as (1) and (2). 


a (3) 


3 
OL, 


X. 5. Miscellaneous properties.—The following are 
some miscellaneous properties which are useful in deter- 
mining whether a set is a tensor. 


(i) The sum (or difference) of two tensors of the same 
rank and character and the same suffix is a tensor. 
[Suppose, for instance, that 4* and B* are contravariant 
tensors. Then 
Ah = OU a ge BP = 22a pa 
orn np a 


and therefore 
oa" 
(4B) = 2 (t+ BY | 
(ii) The product of two tensors is a tensor whose character 


is the combination of the characters of the two. For example, 
from 


on Oar" on", 
ee SO ae 
r B 1 
we see that 
dar, ay da", - 
(AG BON aie Ee), 


02, dag Qn, 
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(iii) An inner product of two tensors, or an inner sum 
($1 (iv)) of a tensor, taken with regard to suffixes of 
opposite character, is a tensor. 

[Take, for example, 
oz, 7 day d2"p 


A’? am WOE aed 
vd x’, da,/ da" 7 ots aby" 


If we replace p by v we have, by (VIII. 3. 3), 


dx, 32’, 
On”, 3a, 


See Ae es: 


and therefore 
A” = P av. P) 0% Y 
Me Salsa Ne 


which satisfies the requirements. | 


(iv) If in this last example we had replaced w by A, 
instead of p by v, the expression would have contained 


oz, oe 
on", r Ou’ Oh 


which has no general significance. 


(v) The derivative of a scalar (¢ 3 (ii)) is a covariant 
vector. Suppose, e.g., that 4 is a scalar function of a, 
whose value remains constant (§ 3 (ii)) for all transforma- 
tions. Then 

dd’ dd du, dA 


on, dar, «da, on 


so that 04/02, is a covariant vector. 


(vi) If the inner product of a set A by each of m (=4) 
contravariant 


¢ t vectors is a tensor, then @ is a tensor, and 
covariant 
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covariant 

} contravariant 
suffix. 

[Take the case in which the vectors are contravariant, 
and suppose that @=4 ies Let the m vectors be 
denoted by B, B*4,.,..B™, or, collectively, by B\”; the A 
not indicating any tensor character. Then, by hypothesis, 
B*¥ is a tensor as regards Lys and aA ... 18 a tensor as 
regards x,..., so that 


} as regards @,, where uw is the linked 


Ox On 
(st Ses oe MX 4! 5), AB 
B= yn ey, ee B Sg negate Ag... 


From these we ieee, 


02 OL 
IAL yt == sd fh B ry. 
B A on Oa Bryese? 


whence, by division by B” (see VI. 9 (v)), 


A’ = —..A 
WVaee 7 P DOO ASH A0 Y 
Ou" , On", 
Hence 4,,,,, is a tensor and is covariant as regards ~,. 


The case in which the vectors are covariant can be dealt 
with in the same way. | 
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APPENDIX 
PRODUCT OF DETERMINANTS 


In IV. 5 we have taken as the standard form for product 
of two determinants—the order being now reduced from 3 
to 2, for economy of printing— 

X | 4 P; 410, +4,8, 6,0,+628; |. 

ay Bo MyM, +4282 O42 +428. 

In this form, the element in the gth column and 7th row 
of the result is the ‘inner product’ of the gth column of 
the first determinant and the 7th row of the second. By 
interchanges of columns and rows, and also by changing 
the order of multiplication, we get seven other forms, all 
constructed according to this rule. The eight forms can 
be set out as follows : 


a,b, 


ty by 


Gtx 


a, By 


Qa, +428, 


by ay +558 


1 

Anh, A935 a, Ay +485 ee «) 
a,6,|X| a,a,g | =| a,a,+ 4,0, ey (2) 
Ag by B, Bo 4B, +4282 4,8, + by By z 

M44 |X | a8, | =| a, +413; aga, +44 (3) 
O14, Ay By | | A102 +4, By Ad. +b, By 

MyM, X| a a2| =| aa tha, aa, +40, | (4 

b,5,| | 8, Be By +4 By M8, + FB. 
a8, |X | a4, | =| qa, +40, 4,8, +4;8, (5) 
A 3, ab, G20, +4,a, a8, +4, , | 

a,a, |X| ah) = | aa, +468, a,4,4+4, 8, 6 

B,By| | ay, G0, +028) aya, +48, (8) 
@,)9,| X | 4%) =| a, +4,a, 4,8, +48, 7 

ay Bo b, by ba, +a. 4, 8, +4, By | ”) 
Q, A |X| a, a, | = | aa, +4,8; 4,a,+4,8, (3 

cPace b, 4, 6,0, +6,8, b,a,+4, By ) 
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It will be seen that the last four are the transposed of the 
first four, but in the reverse order; i.e. the transposed of 
(1) (2) (8) (4) are (8) (7) (6) (5). 

In the double-suffix notation these become, the order 
of (5)—(8) being reversed : 


ldo, | x | Cor |= | CONG es ae erg | x | rg |=| On exq | (8) 
| dor | X | erg |= 1% nen | (2) Lege |X | rq |= 1 4raeqa | (7) 
| deg |X |egr |=] 4agerr | (3) lrg |X | gr |= 1 areag | (6) 
lrg |X | lrg |=|Eqeral (4) Lege | X | gr |=] xrega | (5) 
It must be borne in mind that in each of these statements 
q vefers to the column and 7 to the row; e.g. (6) means 
that 

x 


Ay, 
Ay oo 


11%12 By €y1 Ey eyo | 


€5 C59 Bno€y I2ln2 


INDEX OF SYMBOLS 


|dy,| determinant 39 

dps (in Chapter V) ratio of cofactor of dy, to | dg,| 42 

A, single set 44, 58 

Ap double set 45, 59 

Ap, transposed of Ay, 45, 59 

B,C, product-sum (inner product) of B, and C, 47, 62 
* unit set 50, 64 

@ set generally 60 

Aw product of 4 and % 61 

Aj, Byp inner product of Ay, and B,,, 63 

APH inverse of A, 65 

A? reciprocal of Aj, 67 

(AY), inverse of partial set 4,, 70 

[A¥?]in_z inverse of partial set Agy 70 

B,/A* ratio of B, to AY 76 

%3/G ratio of % to A 82 

0%3/0 A) derivative of 33 with regard to A, 85, 86 

(X,.X;,) mean product of deviations of X, and X,. 94 

X* set conjugate to X, 97 

IU improved value of U 106 

Afyo inner sum derived from Af, 116 

A* contrayariant vector, A, covariant vector 117 


GENERAL INDEX 


Under any heading, separate entries are usually in the order of occurrence, 


not in alphabetical order. 


Addition of sets 61 

Adjoint determinant 37; cofactor 
of element of 38 

Auxiliaries 106 


Character of tensor 118 

Cofactor 24; expressed as deter- 
minant 25 

Cofactors, determinant 
structed from 36-38 

Cogredience 79, 84; of statistical 
set and improved values 109; 
relation to contravariance 117 

Cogredient sets 79, 84 

Column of double set 15, 45; of 
determinant 21 

Complete inner product 62 

Conjugate set 97; determination 
of 98-99 ; original set as conju- 
gate of 98; of inner product 101 

Conjugate sets with linear rela- 
tions 100 

Continued inner product 68; 
transposed of 69 ; inverse of 69; ' 
reciprocal of 69 

Contragredience 79, 84; with 
linear relation 80; of variables 
and their partial differential 
operators 88; of conjugate sets 
101-102; relation to covariance 
117 

Contragredient sets 79, 84 

Contravariance, relation of, to 
cogredience 117 

Contravariant vector 117; tensor 
118 

Covariance, relation of, to contra- 
gredience 117 

Covariant vector 117; tensor 118 


con- 


Derivative of set 85; of sum or 
product of sets 86; of function 
of a set 87 

Determinant 21; notation 21, 23, 
39; elements 21; column 21; 
row 21; leading diagonal 21 ; 
leading term 21; order of 21; 
calculation of 32-33 (See also 
separate headings below) 

Determinant, construction of:— 
factors of terms 14-16; rule of 
signs 16-19; sign of term 
dependent on number of re- 
versals of order 17-18; sign of 
term changed by interchange 
of suffixes 18; effect of altering 
order of letters 18-19; final 
definition 20-21 

Determinant, main properties of, 
in ordinary notation 40; in 
tensor notation 52 

Determinant, properties of :— 0 if 
each element of column or row 
is 0 22; not altered by inter- 
change of columns and rows 22; 
sign changed by interchanging 
two columns [rows] 23; 0 if 
two columns [rows] identical 
23; expression in terms of 
elements of column [row] and 
their cofactors 24, 26, 28; sum 
of products of elements of 
column [row] by cofactors of 
parallel elements is 0 27, 28; 
multiplication by single factor 
31; not altered by increasing 
elements of column [row] by 
multiples of parallel elements 
31 


126 


Determinants, sum of 31; pro- 
duct of, in ordinary notation 
38-36; product of, in tensor 
notation 48, 49; forms of pro- 
duct of 122 

Direct proportion 76, 84 

Dummy suffix 48, 62 


Elements of double set 15, 45, 59; 
of determinant 21; of single 
set 44, 58 

Error, reduction of 105 


Fitting 93 

Free suffix 48, 62 

Frequency-quadratic 102; mean- 
ing of coefficients in 103 

Functional relation between sets 
73 


Gothic letters 60 

Graduation 93 

Greek letters for sets 43 seqq.; for 
product-sums 47 


Improved values 106; criteria for 
105-110 ; determination of 110- 
114 

Independence, algebraical 73 n. ; 
statistical 96 

Independent observations, set 
constituted by 97, 110 

Inner multiplication 49 

Inner product 49, 62, 63; trans- 
posed of 63; rule for construc- 
tion of 64 

Inner product, complete 62 

Inner product, continued : see 
Continued inner product 

Inner sum 116 

Inverse of double set 653 con- 
dition for 65-66 ; of transposed 
set 67; original set as inverse 
of inverse set 67 


Jacobian 88 
Key set 20 


Leading diagonal of double set 
15; of determinant 21 


General Index 


Leading term of double set 16; 
of determinant 21 

Linear function of single set 74 

Linear relation between single 
sets 73; between sets generally 
82 

Linked suffix 48, 62 


Matrix of determinant 59n. 

Mean product of deviations, nota- 
tion for 94; algebraical laws 
satisfied by 94 

Mean-product set 96; of self- 
conjugate set, is unit set 97 
(See also separate headings below) 

Mean-product set, reciprocal of 
96; is mean-product set of 
conjugate set 99 

Mean-product set for partial varia- 
tion 104 

Minor 25; relation of, to co- 
factor 25-26 

Mixed tensor 118 

Moments, extension of method of 
109-110 

Multiplication of set by scalar 61; 
by set 61 


Notation : see INDEX oF SYMBOLS 


Order of determinant 21 ; of single 
set 58; of double set 59; of set 
generally 60 

Outer product 49, 61 


Partial differential coefficient, 
reason for appearance of 115-116 

Partial double set 70; inverse or 
reciprocal of 70 

Partial single set 70 

Product, inner: see Inner product 

Product, outer 49, 61 

Product of determinant by single 
factor 31; of two determinants, 
see Determinants; of two sets 
45-46, 61 

Product-sum notation 47 

Proportion, direct 76, 84; reci- 
procal 77, 84 


Quadratic form, expressed as inner 
product of two single sets 81; 


General Index 


derivative of 86; expressed as 
sum of squares 88-89 


Rank of set 60 

Ratio, as operator 76; of single 
sets 76; of sets generally 81, 
82; reciprocal of 81, 82 

Ratios, equal 75, 76, 82, 83; in- 
version of 76, 78; cross-multi- 
plication of 78 

Reciprocal determinant 41-42 

Reciprocal of double set 67; 
original set as reciprocal of 
reciprocal set 67 

Reciprocal proportion 77, 84 

Reciprocation 67 

Reduction of error 105 

Relativity and statistical theory, 
similarity of methods for 93 

Reversals of order 17 

Row of double set 15, 45; of 
determinant 21 


Scalar 46, 60; as invariant 118; 
derivative of 120 

Self-conjugate set 100 

Set, double 15, 45, 59; deter- 
minant of 59; transposed of 
59; inverse of 65; reciprocal 
of 67 

Set, single 44, 58; as sum of com- 
ponent single sets 71 

Set-notation 42; principles of 44 

Sets, functional relation between 
73 

Simultaneous equations:—method 
ofindividual solution 11; formu- 
lae for particular cases 12-13, 
29; statement of general prob- 
lem 13-14; trial formula for 


127 


denominator 
solution 30 

Statistical independence 96 

Statistical set, nature of 92-93 

Substitution-operator, unit set as 
64-65 

Successive summations, order im- 
material 48 

Suffixes, upper 47 

Sum of determinants 31 

Symbols: see InpEx or SymBoxs 

Symmetrical double set 45, 60; 
identity of inverse and reci- 
procal sets 67 


14-19; general 


Tensor 115; conditions for 116- 
118; contravariant 118; co- 
variant 118; mixed 118; reason 
for limitation 118 

Tensors, properties of 119-121; 
sum or difference 119; product 
119; inner product or inner 
sum 120 

Transposed of determinant 22; of 
double set 59; of inner product 
63; of inverse set 67; of reci- 
procal set 67 


Unit set 51, 64; properties of 49- 
51, 64-65; symmetry of 51, 64; 
action as substitution-operator 
64-65; inner product of two 
unit sets 65 


Variable sets 72 

Variation, direct 79, 84; reci- 
procal 79, 84 

Vector, ordinary, relation tosingle 
set 46 n., 59 

Vector, tensor 117 
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